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PREFACE 


Tue Aam of this edition, as of the first edition, is to provide an under- 
standing of each of the major learning theories in relation to the experi- 
ments to which that theory gives rise. The years since the first edition 
have seen an enormous expansion of psychological research in learning 
and no dearth of theoretical contributions. The task of providing an intro- 
ductory exposition that is at once clear and critical becomes increasingly 
difficult. 

I have chosen to deal with the major positions that flourished in the 
first half of the twentieth century in much the way they were dealt with 
in the first edition, building the exposition around the psychologists who 
gave currency to the several theories. Thus the chapters on Thorndike, 
Guthrie, Hull, Skinner, Tolman, classical gestalt, Lewin, and functional- 
ism are repeated, though each has been thoroughly rewritten. New chap- 
ters include one on the learning significance of Freud’s psychodynamics, 
one on mathematical models, and two on other current developments. 
The over-all result is a book of one-third more pages. Despite the orienta- 
tion toward historical antecedents of current theory, nearly half the 
literature cited is new to this edition. 

In the effort to keep the book from becoming unduly large, some of 
the material from the first edition has not been incorporated in the re- 
vision, The chapter on Wheeler's views is omitted with the regret that 
neither he nor any of his followers has revised the position sufficiently to 
keep it currently active. Much of the material from the dropped chapter 
on theories influenced by field conceptions has been treated in other ways, 
but some of it has been lost in the revision. So, too, Hull’s 1943 postulates 
have had to give way to allow a critical exposition of his final 1952 system. 
For some purposes, therefore, the present edition does not completely re- 
place the earlier one, and occasional references are made in the new 
edition to non-repeated material in the earlier one. 

The two chapters on current developments recognize that the ad- 
vances in mid-century cannot be confined to the standard positions that 
became defined in the first half of the century. Younger psychologists 
who began as students of one or another of the masters have matured as 


theorists in their own right and can no longer be subordinated as disciples 
in 
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or defenders of older views. In the new chapters I have tried to trace the 
connections with the earlier positions, while recognizing that we are mov- 
ing forward, and debate is no longer confined to old issues. Some mem- 
bers of the new generation of theorists no doubt now deserve chapters in 
their own right; but until the new viewpoints are somewhat better crys- 
tallized, the necessary distinctions would undoubtedly prove confusing 
to the student who is trying for the first time to discover the identifying 
characteristics of the longer established theories. 

It is my hope that this book may assist the student of the psychology 
of learning and his teachers to find their way into the current experi- 
mental and theoretical literature of learning—a literature now increas- 
ingly forbidding without a guide. While my task is expository and not 
polemical, I have not refrained from expressing personal judgments when 
these seemed appropriate to me. I hope, however, that these personal in- 
trusions will be recognized for what they are, and that they remain dis- 
tinguishable from the more objective exposition of the theories here con- 
sidered. In the final chapter on retrospect and prospect I have departed 
from the expository style of the earlier chapters in order to estimate where 
we now stand on the more important isssues and to indicate the paths 
that seem likely to lead to clarification and ultimate agreement. 

Suggestions that have changed the second edition have come from 
many students, colleagues, and critics, most of whom will have to go un- 
named. I cannot refrain from mentioning the patient reading and helpful 
comments of the editors of this series, R. M. Elliott and Kenneth Mac- 
Corquodale. Individual chapters benefited through critical reading by 
Douglas H. Lawrence, Robert R. Sears, Morris I. Stein, Patrick C. Suppes, 
and Thurlow R. Wilson. I am grateful for permission to use as yet un- 
published materials from Neal E. Miller, O. Hobart Mowrer, B. F. Skin- 
ner, and Kenneth W. Spence. The typing was done by Miss Irene Payne 
and the new charts drafted by P. Stanley King. 

The following publishers permitted the reproduction of figures or 
quotations: Appleton-Century-Crofts, Inc., Cambridge University Press, 
Columbia University Press, Cornell University Press, Duke University 
Press, Free Press, Harper and Brothers, Harvard University Press, Ho- 
garth Press, Henry Holt and Company, Houghton Mifflin Company, 
Kegan Paul, Trench, Trubner and Company, Liveright Publishing Corpo- 
ration, McGraw-Hill Book Company, Inc., The Macmillan Company, 
National Society for the Study of Education, Prentice-Hall, Rinehart and 
Company, Ronald Press, Stanford University Press, Teachers College, 
Columbia University, University of Oregon Press, John Wiley and Sons, 
Inc., Yale University Press. The publishers of the following journals and 
monograph series have permitted quotations from these sources: Ameri- 
can Journal of Psychology, American Psychologist, Contributions to Psy- 
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chological Theory, Genetic Psychology Monographs, Journal of Abnormal 
and Social Psychology, Journal of Comparative Psychology, Journal of 
Comparative and Physiological Psychology, Journal of Experimental Psy- 
chology, Journal of General Psychology, Journal of Psychology, Psycho- 
logical Bulletin, Psychological Review, University of California Publica- 
tions in Psychology, University of Iowa Studies in Child Welfare. The 
original source is cited in each case. 


Stanford, California 
E.R.H. 
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THEORIES of LEARNING 


Chapter 1 


THE NATURE OF LEARNING 
THEORIES 


HE scientific study of learning is carried on primarily by psycholo- 

gists. Psychology’s claim to the field was staked in part by masterly 
pioneers such as Ebbinghaus (1885),* Bryan and Harter (1897, 1899), 
and Thorndike (1898). Those who have followed in their footsteps have 
been primarily psychologists. Professional educators have welcomed edu- 
cational psychology as a foundation science upon which to build their 
practices, and studies of learning have gone on concurrently in labora- 
tories of general psychology and laboratories of educational psychology, 
with interplay between the pure and applied fields. Under the circum- 
stances, it is very natural for psychologists to feel that the study of 
learning belongs to them. 

In addition to historical reasons, there is another basis on which to 
account for the psychologist’s interest in learning. This is the centrality 
of learning in the more general systems of psychological theory. A scien- 
tist, along with the desire to satisfy his curiosity about the facts of nature, 
has a predilection for ordering his facts into systems of laws and theories. 
He is interested not only in verified facts and relationships, but in neat 
and parsimonious ways of summarizing these facts. Psychologists with 
a penchant for systems find a theory of learning essential because so 
much of man’s diverse behavior is the result of learning. If the rich 
diversity of behavior is to be understood in accordance with a few princi- 
ples, it is evident that some of these principles will have to do with the 
way in which learning comes about. 

Many psychologists make explicit acknowledgment of the centrality 
of learning in their broader systems. Three examples may be cited. 

1. In his definition of behavior as molar rather than as molecular (a 


1 References cited can be found by author and date in the list at the end of the book. 
1 
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distinction that lies at the very heart of his system), Tolman? lists docility 
(“teachableness”) as the crowning characteristic of such behavior. All 
molar behavior is docile. Hence learning becomes for Tolman an iden- 
tifying character of that which he wishes to include as behavior. 

2. Guthrie makes of learning the mark of mind. As he puts it: 


The ability to learn, that is, to respond differently to a situation because of 
past response to the situation, is what distinguishes those living creatures which 
common sense endows with minds. This is the practical descriptive use of the 


term “mind.” # 

8. Hull,‘ in introducing his theory of the behavior sciences, finds it 
natural to devote the first volume of his series to learning theory. He 
scarcely distinguishes between a theory of learning and a theory of be- 
havior, so important is learning in his conception of behavior. While his 
books are primarily about learning, their titles reflect the intended gen- 
erality: Principles of behavior, A behavior system. 

Although not all psychologists give this same prominence to learn- 
ing in their theories, the fact that others do makes it imperative for all to 
treat the problems of learning in one way or another. Hence the sys- 


tematic aspects of learning theory have come to be important to all 
psychologists interested in more general theories. 


THE DEFINITION OF LEARNING 
What Learning Includes 


There are many activities which everyone will agree count as il- 
lustrations of learning: acquiring a vocabulary, memorizing a poem, 
learning to operate a typewriter. There are other activities, not quite as 
obviously learned, which easily classify as learned, once you have re- 
flected upon them. Among these are the acquiring of prejudices and 
preferences and other social attitudes and ideals, including the many 
skills involved in the social interplay with other people. Finally there 
are a number of activities whose acquisition is not usually classifiable as 
a gain or improvement because their utility, if such there be, is not 
readily demonstrable. Among these are tics, mannerisms, autistic gestures. 

Such a pointing to illustrations of learning serves very well as a first 
approximation to a definition. It is, in fact, extremely difficult to write 
an entirely satisfactory definition. Although we are tempted to define 

2 Tolman (1932b), pages 14-16. 


3 Guthrie (1952), page 3. 
4 Hull (1943), 
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learning as improvement with practice, or as profiting by experience, we 
know very well that some learning is not improvement, and other learn- 
ing is not desirable in its consequences. To describe it alternatively as any 
change with repetition confuses it with growth, fatigue, and other changes 
which may take place with repetition. The following definition may be 
offered provisionally: 

Learning is the process by which an activity originates or is changed through 
reacting to an encountered situation, provided that the characteristics of the 
change in activity cannot be explained on the basis of native response tendencies, 
maturation, or temporary states of the organism (e.g., fatigue, drugs, etc.). 

The definition is not formally satisfactory because of the many un- 
defined terms in it, but it will do to call attention to the problems involved 
in any definition of learning. The definition must distinguish between (1) 
the kinds of changes, and their correlated antecedents, which are in- 
cluded as learning, and (2) the related kinds of changes, and their 
antecedents, which are not classified as learning. We can now go on to 
consider some of the changes that are excluded by our provisional defini- 


tion. 


Native Response Tendencies versus Learning 

The older catalogs of innate behavior usually included among un- 
learned activities the reflexes (such as pupillary constriction to light), 
the tropisms (such as a moth’s dashing into a flame), and the instincts 
(such as a bird’s nest-building). We may continue to acknowledge such 
activities as species characteristics. After a period during which it was 
rather emotionally tabooed, the term “instinct” is again part of the psy- 
chologist’s vocabulary, referring to complex unlearned activity charac- 
teristic of a species.” 

The problem of instinct versus learning is not solved by attempting 
to classify some behavior as instinctive in its entirety and other behavior 
as learned in its entirety. The in-between nature of behavior with large 
instinctive components is well illustrated by the experiments on im- 
printing.» A young duckling, for example, is prepared instinctively to 
accept a certain range of mother-figures, characterized by size, move- 
ment, and vocalization. Once such a mother-figure has been accepted 
and followed about, only this particular mother now satisfies the instine- 
tive demand. The selected mother (who may be Professor Lorenz crawl- 
ing on hands and knees) has become imprinted, and is the only “mother” 


5 Tinbergen (1951). 
€ E.g., Lorenz (1952), Ramsay (1951). 
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the duckling will now follow. Imprinting is then a form of learning, but 
a form very closely allied to the instinctive propensities of a particular 
kind of organism of a particular age. (We lack at present evidence for 
imprinting except in fowls and other birds.) The problem of distinguish- 
ing between the instinctive and learned components of the behavior 
illustrated by imprinting is an experimental problem, depending for its 
clarification upon the ingenuity of the experimenter in designing ap- 
propriate control experiments. 


Maturation versus Learning 


Growth is learning’s chief competitor as a modifier of behavior. If 
a behavior sequence matures through regular stages irrespective of inter- 
vening practice, the behavior is said to develop through maturation and 
not through learning. If training procedures do not speed up or modify 
the behavior, such procedures are not causally important and the changes 
do not classify as learning. Relatively pure cases like the swimming of 
tadpoles and the flying of birds can be attributed primarily to matura- 
tion. Many activities are not as clear-cut, but develop through a complex 
interplay of maturation and learning. A convenient illustration is the de- 
velopment of language in the child. The child does not learn to talk 
until old enough, but the language which he learns is that which he hears. 
In such cases it is an experimental problem to isolate the effects of 


maturation and of learning. The ambiguity in such cases is one of fact, 
not of definition. 


Fatigue versus Learning 


When activities are repeated in rapid succession, there is often a 
loss in efficiency commonly attributed to fatigue. Such changes in per- 
formance are called work decrements in the experimental laboratory. 
The units of a work curve are like those of a practice curve: performance 
plotted against trials or repetitions. Hence the experimental arrangements 
in obtaining a work curve are essentially those of a learning procedure 
and, at first sight, it appears to be a form of question-begging to define 
the processes involved by the results obtained. It would be question- 
begging, however, only if we were to equate learning or fatigue with the 
change in performance. Actually both learning and fatigue are inferences 
from the performances, and it is permissible to make such inferences as 
the obtained performances require or suggest. Fatigue curves tend to 
show decreasing proficiency with repetition and recovery with rests, 
Learning curves ordinarily show gains with repetitions and forgetting 
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over rests, These typical differences between learning effects and fatigue 
effects are evident enough, but the inferences from performance are made 
on somewhat more complex evidence. It is because of the complexity of 
these inferences that it is difficult to state a concise definition of learning 
which will conserve the learning inferences from performance while 
eliminating the fatigue inferences. The problem is logically the same as 
distinguishing changes due to maturation and to learning. But again the 
ambiguity is one of fact, not of definition. 

Learning always must remain an inference from performance, and 
only confusion results if performance and learning are identified. A clear 
illustration is provided by performance under the influence of drugs or 
intoxicants. The fact that learned behavior fails when the organism is in 
such a state does not mean that forgetting has occurred. When the normal 
state has been restored, the performance may return to normal levels 
although there has been no intervening training. 


Learning and the Nervous System 


Some definitions of learning avoid the problem of performance by 
defining learning as a change in the central nervous system. So long as 
this change in the nervous system persists, temporary changes in state, 
such as those in fatigue and intoxication, affect performance but not 
learning. This definition asserts that learning is an inference, but it goes 
on to make a particular sort of inference about the role of the nervous 
system in learning. In view of the lack of knowledge of what actually 
does take place inside the organism when learning occurs, it is preferable 
not to include hypothetical neural processes in the definition of learning. 
We know that learning takes place. We should therefore be able to define 
what we are talking about without reference to any speculation what- 
ever, This position does not deny that what we are calling learning may 
be a function of nervous tissue. It asserts only that it is not necessary to 
know anything about the neural correlates of learning in order to know 


that learning occurs. 


Learning, Problem-Solving, and Reasoning 

After you have learned, there are many things which you are able 
to do. If you can add and subtract, you can solve many novel problems 
without learning anything new. Where the solution of problems is rela- 
tively mechanical (as in addition and subtraction), the problem may 
be thought of as merely the exercise or utilization of a learned bit of be- 
havior. When, however, there is greater novelty, more putting of things 
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into relationship, as in reasoning or inventiveness, the process is interest- 
ing in its own light, and is not to be described simply as the running 
off of old habits. 

The question has been raised, especially by Maier,” as to the appro- 
priateness of including processes like reasoning within the same classifi- 
cation as other kinds of learning. My preference is for including them. 
Leaving them in does not prejudge their explanation. There may be 
several kinds of learning from the simpler to the more complex, not all 
following the same principles. If so, we have no assurance that the only 
sharp break comes when “reasoning” appears. Leaving the doubtful 
processes in simply asserts that a complete theory of learning must 
have something to say about reasoning, creative imagination, and in- 
ventiveness, in addition to what may be said about memorizing and 
retaining or about the acquisition of skill. 


Definition Not a Major Source of Disagreement Between 
Theories 


While it is extremely difficult to formulate a satisfactory definition 
of learning so as to include all the activities and processes which we wish 
to include and eliminate all those which we wish to exclude, the diffi- 
culty does not prove to be embarrassing because it is not a source of 
controversy as between theories. The controversy is over fact and in- 
terpretation, not over definition. There are occasional confusions over 
definition, but such confusions may usually be resolved by resort to 
pointing, to denotation. For the most part it is satisfactory to continue to 
mean by learning that which conforms to the usual socially accepted 
meaning that is part of our common heritage. Where distinctions have 


to be made with greater precision, they can be made through carefully 
specified types of inference from experiments, 


SOME TYPICAL PROBLEMS CONFRONTING 
LEARNING THEORIES 


The preferences of the theorist often lead him to concentr 
one kind of learning situation to the neglect of the others, His 
then appropriate to this situation, but becomes somewhat strained in rela- 
tion to other problems of learning. A comprehensive learning theory 
ought to answer the questions which an intelligent non-psychologist 
might ask about the sorts of learning which are met in everyday life, 

T Maier (19381), 


ate upon 
theory is 
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A few such questions will be listed here, and then used later in apprais- 
ing the theories which different writers present. 

l. What are the limits of learning? Here is raised the question of 
the capacity to learn, of individual differences among learners of the 
same species and of unlike species. There are questions not only of per- 
sistent differences in capacity, but of change in capacity with age. Who 
can learn what? Are the limits set at birth? Do people get more or less 
alike with practice? These are the sorts of questions which it is natural 
to raise. 

2, What is the role of practice in learning? The old adage that 
practice makes perfect has considerable racial wisdom behind it. Surely 
one learns to roller skate or to play the piano only by engaging in the 
activity. But what do we know about practice in detail? Does improve- 
ment depend directly on the amount of repetition? If not, what are its 
conditions? What are the most favorable circumstances of practice? 
Can repetitive drill be harmful as well as helpful to the learner? 

8. How important are drives and incentives, rewards and punish- 
ments? Everybody knows in a general way that learning can be con- 
trolled by rewards and punishments, and that it is easier to learn some- 
thing which is interesting than something which is dull. But are the con- 
sequences of rewards and punishments equal and opposite? Is there a 
difference between intrinsic and extrinsic motives in their effect upon 
learning? How do goals and purposes affect the process? 

4. What is the place of understanding and insight? Some things are 
learned more readily if we know what we are about. We are better off 
as travelers if we can understand a timetable or a road map. We are 
helpless with differential equations unless we understand the symbols 
and the rules for their manipulation. But we can form vowels satisfac- 
torily without knowing how we place our tongues, and we can read 
without being aware of our eye movements. Some things we appear to 
acquire blindly and automatically; some things we struggle hard to 
understand and can finally master only as we understand them. Is learn- 
ing in one case different from what it is in the other? 

5. Does learning one thing help you learn something elseP This is 
the problem of formal discipline, as it used to be called, or of transfer 
of training, to use a more familiar contemporary designation. Some 
transfer of training must occur or there would be no use in developing 
a foundation for later learning. Nobody denies that it is easier to build 
a vocabulary in a language after you have a start in it, or that higher 
mathematics profits from mastery of basic concepts. The question is really 
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one of how much transfer takes place, under what conditions, and what 
its nature is. 

6. What happens when we remember and when we forget? The 
ordinary facts of memory are mysterious enough, but in addition to 
familiar remembering and forgetting, our memories may play peculiar 
tricks on us. Some things we wish to remember are forgotten; some 
things we would be willing to forget continue to plague us. In cases 
of amnesia there are often gaps in memory, with earlier and later events 
remembered. Then there are the distortions of memory, in which we re- 
member what did not happen, as is so strikingly demonstrated in testi- 


mony experiments. What is taking place? What control have we over 
processes involved? 


These six questions will serve as useful ones to ask of each of the 
major theories. They suffice to illustrate the kinds of questions which 
give rise to theories of learning. 


ISSUES ON WHICH LEARNING THEORIES DIVIDE 


In the preceding section we asked certain common-sense questions 
about learning, on the assumption that a good learning theory should 
have something to say about each of them. Such questions can be raised 
before we know anything about actual learning theories. Now we wish 
to turn, however, to certain issues that have arisen in the formulation of 
actual theories. By alerting us in this way to what is to follow, we are 
better prepared for some of the differences in flavor that we shall meet 
as we review one theory after another. 

Learning theories fall into two major families: stimulus-response 
theories and cognitive theories, but not all theories belong to these two 
families. The stimulus-response theories include such diverse members 
as the theories of Thorndike, Guthrie, Skinner, and Hull. The cognitive 
theories include at least those of Tolman, the classical gestalt psycholo- 
gists, and Lewin. Not completely and clearly classifiable in these terms 
are the theories of functionalism, psychodynamics, and the probabilistic 
theories of the model builders. The lines of cleavage between the two 
families of theories are not the only cleavages within learning theories; 


there are other specific issues upon which theories within one family may 
differ, 
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General Issues Producing a Cleavage Between Stimulus- 
Response and Cognitive Theories 


The cleavages between the theorists of opposing camps are diffi- 
cult to understand because many of the distinctions which at first seem 
to contrast sharply later are found to be blurred. All reputable theorists 
accept a common logic of experimentation, so that disagreements over 
experimentally obtained facts are readily arbitrated. In the end, all the 
theorists accept a common body of demonstrated relationships, at the 
factual or descriptive level; any theorist who denied an established fact, 
a reproducible experimental finding, would lose status among his scien- 
tific colleagues, and his theories would no longer command respect. The 
first rule that we must be prepared to accept, as we judge the relative 
merits of different theories is this: All the theorists accept all of the facts. 
Some experimental findings are doubted when they are first announced 
and the status of findings as fact may for a long time be doubted; but 
once the status as fact is established, all accept the fact as true. Hence 
the differences between two theorists are primarily differences in inter- 
pretation. Both theories may fit the facts reasonably well, but the pro- 
ponent of each theory believes his view to be the more fruitful. We shall 
be better prepared later on to discuss the ways in which theories get 
validated or modified after we are acquainted with them in more detail. 
For the present, we must be prepared to accept the historical truth that 
Opposing theories have great survival value, and that an appeal to the 
facts as a way of choosing between theories is a very complex process, 
not nearly as decisive in practice as we might expect it to be. 

We may begin by examining three kinds of preferences on which 
Stimulus-response theorists tend to differ from cognitive theorists. 


1. “Peripheral” versus “Central” Intermediaries 


Ever since Watson promulgated the theory that thinking was merely 
the carrying out of subvocal speech movements, stimulus-response 
theorists have preferred to find response or movement intermediaries to 
Serve as integrators of behavior sequences. Such movement-produced 
intermediaries can be classified as “peripheral” mechanisms, as contrasted 
with “central” (ideational) intermediaries. The stimulus-response theo- 
rist tends to believe that some sort of chained muscular responses, linked 
Perhaps by fractional anticipatory goal responses, serve to keep a rat 
running to a distant food box. The cognitive theorist, on the other hand, 
More freely infers central brain processes, such as memories or expec- 
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tations, as integrators of goal-seeking behavior. The differences in pref- 
erence survive in this case because both kinds of theorists depend upon 
inferences from observed behavior, and the inferences are not directly 
verified in either case. It is potentially easier to verify tongue movements 
in thinking than it is to discover a revived memory trace in the brain, but 
in fact such verification is not offered with the precision necessary to com- 
pel belief in the theory. Under the circumstances, the choice between the 
peripheral and the central explanation is not forced, and favoring one or 
the other position depends upon more general systematic preferences. 


2. Acquisition of Habits versus Acquisition of Cognitive 
Structures 

The stimulus-response theorist and the cognitive theorist come up 
with different answers to the question, What is learned? The answer 
of the former is “habits”; the answer of the latter is “cognitive structures,” 
The first answer appeals to common sense: we all know that we develop 
smooth-running skills by practicing them; what we learn is responses, 
But the second answer also appeals to common sense: if we locate a candy 
store from one starting point, we can find it from another because we 
“know where it is”; what we learn is facts. A smooth-running skill illus- 
trates a learned habit; knowing alternate routes illustrates cognitive struc- 
ture. If all habits were highly mechanical and stereotyped, variable non- 
habitual behavior would force us to admit cognitive structures as part, at 
least, of what is learned. But the stimulus-response psychologist is satis- 
fied that he can deduce from the laws of habit formation the behavior 
that the cognitive theorist believes supports his interpretation. Hence we 
cannot choose between the theories by coming up with “decisive” illustra- 
tions of what we learn, for both groups of theorists will offer explanations 


of all our examples. The competing theories would not have survived 
thus far had they been unable to offer such explanations, 


3. Trial and Error versus Insight in Problem-Solving 

When confronted with a novel 
solution? The stimulus-res 
bling his habits from th 
ing either according to 
with familiar ones, or 
are similar to situation 


problem, how does the learner reach 
ponse psychologist finds the learner assem- 
e past appropriate to the new problem, respond- 
the elements that the new problem has in common 
according to aspects of the new situation which 
s met before. If these do not lead to solution, the 
learner resorts to trial and error, bringing out of his behavior reper- 
tory one response after another until the problem is solved. The cog- 
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nitive psychologist agrees with much of this description of what the 
learner does, but he adds interpretations not offered by the stimulus- 
response psychologist. He points out, for example, that granting all the 
requisite experience with the parts of a problem, there is no guarantee 
that the learner will be able to bring these past experiences to bear upon 
the solution. He may be able to solve the problem if it is presented in one 
form and not solve it if it is presented in another form, even though 
both forms require the same past experiences for their solution. Ac- 
cording to the cognitive theorist, the preferred method of presentation 
permits a perceptual structuring leading to “insight,” that is, to the 
understanding of the essential relationships involved. The stimulus- 
response psychologist tends, by preference, to look to the past history 
of the learner for the sources of solution, while the cognitive psychologist, 
by preference, looks to the contemporary structuring of the problem. 
His preference for the past does not require the stimulus-response psy- 
chologist to ignore the present structuring of the problem, nor does 
his preference for the present require the cognitive psychologist to ig- 
nore the past, One must not assume because there is a difference in pref- 
erence that either theorist is blind to the totality of the learning situa- 
tion. The facts of the insight experiment are accepted by both theorists, 
as are the facts of skill learning. We may remind ourselves again that 
no single experiment will demolish either the interpretation according 
to trial and error or the interpretation according to insight. 

These three issues—peripheral versus central intermediaries, ac- 
quisition of habits versus acquisition of cognitive structures, and trial and 
error versus insight in problem-solving—give something of the flavor 
of the differences between these two major families of theories. 


Specific Issues Not Confined to the Major Families 

Some issues lie outside the conflict between the stimulus-response 
and the cognitive theories. Thus two stimulus-response psychologists 
may differ as to the role of reinforcement in learning, and two cognitive 
theorists may differ as to the necessity for a physiological explanation of 
learning, Three of these issues will suffice to alert us to the many prob- 


lems that learning theorists face. 


1. Contiguity versus Reinforcement 

The oldest law of association is that ideas experienced together 
tend to become associated. This has come down in one form or another 
to the present day as the principle of association by contiguity, although 
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it is now more fashionable to describe the association as between stimuli 
and responses rather than as between ideas. Several of our contemporary 
theorists accept the principle of contiguous association, notably Guthrie 
(a stimulus-response psychologist) and Tolman (a cognitive psycholo- 
gist). Other theorists insist that’ learning does not take place through 


contiguity alone, unless there is some sort of reinforcement, some equiv- 
alent of reward or punishment. ) 


2. One or More Kinds of Learning? 


The contiguity-reinforcement dilemma may be resolved by accept- 
ing both, thus defining two varieties of learning. This solution has ap- 
pealed to theorists such as Thorndike and Skinner and Mowrer. But 
these two varieties are not the only possibilities. Perhaps by using the 
common name “learning” to cover the acquisition of motor skills, the 
memorization of a poem, the solving of a geometrical puzzle, and the 
understanding of a period in history, we are deceiving ourselves by look- 
ing for common laws explanatory of processes that have little in common, 

Hence the theorist has to choose between a single-factor theory and 


a multi-factor one. Tolman at one time pointed to the possibility of seven 
kinds of learning. 


3. Intervening Variables versus Hypothetical Constructs 8 


We have already considered a contrast between two types of inter- 
mediary, the peripheral and the central types. But as theories become 
more refined, additional problems arise over the wa 
intermediaries should be specified. One kind of intermediary found in 
theories is a mathematical constant that reappears in various contexts, 
such as the acceleration of a free-falling body (g) that appears in equa- 
tions describing the movement of a pendulum, the path of a projectile, 
or the way in which balls roll down inclined planes. Such an integrating 


intervening variable need have no properties other than those expressed 
in its units of measurement, that is, it need 


ence, apart from the functional relationshi 
text. This kind of integrating intermedia 
called an intervening variable. By contr: 
are concrete, tangible, palpable, with 
for example, we describe the behavior that results when a cat is con- 
fronted with a barking dog. The cat arches its back, hisses, its hair stands 
on end, and numerous changes take place within its digestive and circu- 
latory system. Many of the internal changes can be explained by the 
€ MacCorquodale and Mech 


(1954). (1948), Marx (1951b), Ginsberg (1954), 


y in which inferred 


have no independent exist- 
ps it has in its systematic con- 
ry, without surplus meanings, is 
ast, some kinds of intermediaries 
properties of their own. Suppose, 


Maze 
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use of a demonstrable intermediary, adrenin, the hormone of the 
adrenal glands. Suppose that before adrenin was isolated a theorist had 
inferred that some substance in the blood stream was causing the internal 
changes. This would have been a hypothetical construct at this stage, 
an inferred intermediary with palpable qualities. The discovery of 
adrenin would have then confirmed the hypothesis that such a substance 
in the blood stream was, in fact, causing many of the changes. Adrenin, 
as a substance, has other properties than those inferred from bodily 
changes in emotion. In this it differs from a mere intervening variable, 
which has no further properties beyond its systematic ones. 

Those who hold with intervening variables in their learning theories 
are free to choose such variables as they wish, provided they serve their 
systematic purposes of producing a more coherent and parsimonious 
theory than can be produced without them. Those who prefer hypo- 
thetical constructs must seek either demonstrable movements or secre- 
tions (if they are peripheralists ), or some physiological brain processes 
(if they are centralists). Again, the issue over intervening variables or 
hypothetical constructs is not confined to one or the other of the major 
theoretical families. 

One extreme position is that we can do away with intermediaries 
entirely (Skinner). Thus, on this issue as on the others, we have nearly 
all possible views represented. 

This brief introduction to three contrasts between stimulus-response 


theories and cognitive theories, and three issues that are not confined 


to the two major families, should make it clear that what seem to be 


diametrically opposed points of view may turn out to be based on 


differences in preference, each being possible of persuasive statement, 


and to a point justifiable. The opposed cases are each made by intelligent 


men of good will. We shall have to wait until later to consider how a 


more unified outlook may eventually be achieved. 


THE PLAN OF THIS BOOK 


The student of learning, conscientiously trying to understand learn- 


ing phenomena and the laws regulating them, is likely to despair of find- 


ing a secure position if opposing points of view are presented as equally 
plausible, so that the choice between them is made arbitrary. He may 


fall into a vapid eclecticism, with the general formula, “There’s much to 


be said on all sides.” 
This is not a necessary outcome of a serious attempt to understand 


°pposing points of view. Science ought to be systematic, not eclectic, 
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but a premature systematic position is likely to be dogmatic and bigoted 
just as an enduring eclecticism is likely to be superficial and opportunistic. 
It is possible to have systematization of knowledge as the goal without 
permitting the desire for system to blind the seeker after it to the truths 
unearthed by those with views unlike his own. 

Throughout the chapters that follow, in presenting one after the other 
a variety of systematic positions, with illustrative experiments testing 
their assertions, the effort is made to show that there is something to be 
learned from each of them. Each has discovered phenomena which move 
us forward in our knowledge about learning. At the same time, no one 
has succeeded in providing a system invulnerable to criticism. The con- 


struction of a fully satisfactory theory of learning is likely to remain for 
a long time an uncompleted task. 
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Chapter 2 


THORNDIKE’S CONNECTIONISM 


OR nearly half a century one learning theory dominated all others 

in America, despite numerous attacks upon it and the rise of its 
many rivals. It is the theory of Edward L. Thorndike (1874-1949), first 
announced in his Animal intelligence (1898). Its pre-eminence has been 
aptly assessed by Tolman: 
animal learning—not to mention that of child learning— 
arily a matter of agreeing or disagreeing with Thorndike, 
or trying in minor ways to improve upon him, Gestalt psychologists, conditioned- 
reflex psychologists, sign-gestalt psychologists—all of us here in America seem 
to have taken Thorndike, overtly or covertly, as our starting point.t 


The psychology of 
has been and still is prim 


The basis of learning accepted by Thorndike in his earliest writings 
Was association between sense impressions and impulses to action. Such 
an association came to be known as a “bond” or a “connection.” Because 
it is these bonds or connections which become strengthened or weak- 
ened in the making and breaking of habits, Thorndike’s system has some- 
times been called a “bond” psychology or simply “connectionism.” As 
Such it is the original stimulus-response or S-R psychology of learn- 


Ing. 


CONNECTIONISM BEFORE 1930 


There were few changes in Thorndike’s theory between 1898 and 
1930, During these years Thorndike devoted himself largely to applica- 
tions of his established theory to problems of educational and social im- 
Portance. Because of the stability of the concepts during these years, it 
is possible to select any one of Thorndike’s many publications to serve 
as a guide to his theory. The major work, from which most of the quo- 
tations in what follows have been taken, is the three-volume Educational 


* Tolman (1988a), page 11. i 


16 THEORIES OF LEARNING 
psychology (1913-1914), which represents the system at the height of 
i ularity. 
” ss ma characteristic form of learning of both lower animals and 
man is identified by Thorndike as trial-and-error learning, or, as: he 
preferred to call it later, learning by selecting and connecting. The koa 
is confronted by a problem situation in which he has to reach : = 
such as escape from a problem-box or attainment of food. He does this by 
selecting the appropriate response from a number of possible responses 
A trial is defined by the length of time (or number of errors) in- 
volved in a single reaching of the goal. Thorndike’s earliest experiments 
were of this kind, done chiefly with cats, although some experiments with 
dogs, fish, and monkeys were included.? The typical experiment is = =i 
a hungry cat confined in a box with a concealed mechanism ee 2 
a latch. If the cat correctly manipulates the latch, the door opens, = ‘ a 
cat gains access to the food outside. The first trials are a : 3 : 
great amount of clawing, biting, and dashing about before the p p s 
moved. The score, as measured in elapsed time, is high. a -a = 
trials the time scores get lower, but slowly and irregular : PIAS 
gradualness which suggests that the cat does not really i cate Me ae 
manner of escape, but learns it instead by the stamping in of c 
responses and the stamping out of incorrect ones. : 
Experiments like this have become so commonplace that the im- 
portance of their introduction by Thorndike is easily overlooked. Dy 
contrast with the other laboratory arrangements within which learning 
was studied, the problem-box brought to the fore the problems of motivas 
tion, of rewards and of punishments. The typical laboratory experi- 
ments on learning before Thorndike were either the experiments intro- 
duced by Ebbinghaus on the memorization and recall of verbal mate- 
rials or the experiments on acquistion of skill exemplified by the Bryan 
and Harter studies of 


learning telegraphy. In both of these, motivation 
remains in the background as one of the conte. 


learning capacity and other factors not enter 
In his “law of effect” Thorndike brough 
ground. Trials were defined not by a repe 
or by so many minutes of practice (Bryan and Harter) but by the pe 
formance prior to successful (or unsuccessful) goal attainment. 

Thorndike saw that in his law of effect he had 
supplement to the familiar 1 


? Thorndike (1898) (1911). 


xtual features, along with 
ing as manipulated variables, 
t motivation into the fore- 
tition of a list (Ebbinghaus) 


T- 


added an important 
aw of habit formation through repetition: 
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But practice without zeal—with equal comfort at success and failure— 
does not make perfect, and the nervous system grows away from the modes in 
which it is exercised with resulting discomfort. When the law of effect is omitted 
—when habit-formation is reduced to the supposed effect of mere repetition— 
two results are almost certain. By the resulting theory, little in human behavior 
can be explained by the law of habit; and by the resulting practice, unproduc- 
tive or extremely wasteful forms of drill are encouraged." 

The interest in rewards and punishments which grew out of his 
experiments with animals continued naturally enough as he turned his 
attention to learning as it occurs in schools. There the arguments over 
punishment, promotion, school marks, and other incentive devices were 
rife, even though academic psychologists had not yet awakened to the 
centrality of motivational concepts. 

Thorndike’s experiments on animals had a very profound influence 
upon his thinking about human learning. He became convinced, con- 
trary to the then popular beliefs, that animal behavior was little medi- 
ated by ideas. Responses were said to be made directly to the situation 
as sensed, While he did not go so far as to deny ideation among animals, 
he was convinced that the great bulk of their learning could be explained 
by the direct binding of acts to situations, unmediated by ideas. A com- 
parison of the learning curves of human subjects with those of ani- 
mals led him to believe that the same essentially mechanical phenomena 
disclosed by animal learning are the fundamentals of human learning 
also. Although always aware of the greater subtlety and range of human 
ng preference for understanding more complex 


learning, he showed a stro 
and for identifying the simpler forms of 


learning in terms of the simpler, 
human learning with that of animals. 


actice need emphatic and frequent reminders that man’s 
the action of the laws of readiness, exercise, and effect. 
ative mechanism working to avoid what disturbs the 
es. If we begin by fabricating imaginary powers and 
y loose and empty terms, or if we stay lost in 
lity and inventiveness of the higher forms of 
an’s progress or control his education.* 


Both theory and pr 
learning is fundamentally t 
He is first of all an associ 
life-processes of the neuron 
faculties, or if we avoid thought by 
Wonder at the extraordinary versati 
learning, we shall never understand m 


The systematic position is best understood through the three laws 
to which be refers; readiness, exercise, and effect. It is in accordance 
With these laws that animal and human learning takes place. 


3 Thorndike (1913b), page 22- 
* Thorndike (1913b), page 23. 
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The Law of Readiness 


The law of readiness is an accessory principle which describes a 
physiological substratum for the law of effect. It states the circumstances 
under which a learner tends to be satisfied or annoyed, to welcome or to 
reject. There are three such circumstances: * 


1. When a conduction unit is ready to conduct, conduction by it is satis- 
fying, nothing being done to alter its action. 

2. For a conduction unit ready to conduct not to conduct is annoying, and 
provokes whatever response nature provides in connection with that 
annoying lack. 


8. When a conduction unit unready for conduction is forced to conduct, 
conduction by it is annoying. 


particular 


Although “conduction units” were referred to in his earlier writings 
as though he were talking about actual neurones, Thorndike did not, in 
fact, pay much attention to neuroanatomical details. He talked about 
neurones to be clear that he was talking about direct impulses to action, 
and not about “consciousness” or “ideas.” It must be remembered that 
his system antedated behaviorism, even though its emphasis was defi- 
nitely toward an objective account of behavior. The physiological lan- 
guage was the most available vocabulary for the objectivist prior to the 
rise of behaviorism. Actually Thorndike’s “conduction units” have no 
precise physiological meaning. It would be difficult, for example, to 
understand how a physiological unit unready to conduct could be made 
to conduct. 

If for “conduction unit” a term such as “action tendency” is sub- 
stituted, the psychological meaning of Thorndike’s law of readiness be- 
comes clearer. When an action tendency is aroused through preparatory 
adjustments, sets, attitudes, and the like, fulfillment of the tendency in 
action is satisfying, non-fulfillment is annoying. Readiness thus means 
a preparation for action. Thorndike uses the illustration of an animal 
running after its prey, getting ready all the while for jumping upon it and 
seizing it. He describes a child seeing an attractive object at a distance, 
getting ready to approach it, seize it, and manipulate it. He says that it 
is the neurones which prepare prophetically for later actions in the se- 
quence. This somewhat objectionable manner of 
need not detract from the reality of the psych 
satisfaction and frustration depend upon wh 
pared to do. 


describing what goes on 
ological observation that 
at the organism is pre- 


There is another kind of readiness familiar to educators. This is 
5 The wording follows Thorndike (1913a), page 128, 


THORNDIKE’S CONNECTIONISM 19 
illustrated by the use of “reading readiness” to refer to the child’s reach- 
ing a maturity level appropriate to the beginning of reading, Thorn- 
dike did not use his law of readiness in this way, and it would be his- 
torically inaccurate to construe his law of readiness as an anticipation 
of maturational readiness. There is, of course, a logical relationship be- 
tween the two kinds of readiness, because interests and motives mature 
along with capacities. But Thorndike’s readiness was a law of prepara- 
tory adjustment, not a law about growth. 


The Earlier Law of Exercise 

The Jaw of exercise refers to the strengthening of connections with 
practice (Law of Use) and to the weakening of connections or forget- 
ting when practice is discontinued (Law of Disuse). Strengthening is de- 
fined by the increase in probability that the response will be made when 
the situation recurs. This probability may be either a greater probability 
of occurrence if the situation is repeated immediately, or an equal 
probability persisting longer in time. That is, a stronger connection is 
in a favored competitive position relative to other habits, either at the 
time of its strengthening or if tested after there has been opportunity 
for forgetting. 

The definition of strength by probability of occurrence è has a very 
Contemporary ring. It is acceptable in itself to those who might go on 
to reject the structural basis assigned by Thorndike to changes in strength 
of connections. Although changes were said to occur in neurones and 
Synapses, even in his earlier writings the precise properties attributed to 
neurones were cautiously stated and not actually essential to the theory. 

The kinds of phenomena falling under the law of exercise are chiefly 
those of repetitive habits, as in rote memorizing or the acquiring of mus- 
cular skills. Learning curves in which performance is plotted against 
trials represent the quantification of the law of use; forgetting curves 
give quantitative details for the law of disuse. During the period under 
discussion, Thorndike accepted uncritically the prevailing principle of 
learning by doing, even though he had criticized the use of the principle 
independent of the law of effect. He later altered his position and greatly 


reduced the emphasis upon the law of exercise. 


The Earlier Law of Effect 


te The law of effect refers to the strengthening or weakening of a 
connection as a result of its consequences. When a modifiable connection 


° Thorndike (1913b), page £ 
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is made and is accompanied by or followed by a satisfying state of 
affairs, the strength of the connection is increased; if the connection is 
made and followed by an annoying state of affairs, its strength is de- 
creased. 

Two chief objections have been made to the law of effect by its 
critics. At the height of behaviorism it was objected that satisfaction and 
annoyance were subjective terms, inappropriate for use in describing ani- 
mal behavior. But Thorndike was in reality ahead of his critics, for he 


had early stated what he meant by such states of affairs in what would 
today be called operational terms: 


By a satisfying state of affairs is meant one which the animal does nothing 
to avoid, often doing things which maintain or renew it. By an annoying state 
of affairs is meant one which the animal does nothing to preserve, often doing 
things which put an end to it.7 

These definitions are not circular, so far as the law of effect is con- 
cerned. That is, the states of affairs characterized as sati 
annoying are specified independently of their influence upon modifiable 
connections. The law of effect then states what may be expected to 
happen to preceding modifiable connections which are followed by such 
specified states. The objection that Thorndike was lacking in objectivity 
in the statement of the law of effect is not a valid one, 

The second objection was that the backw. 
affairs on something now past in time is not 
gone, effects can be felt only in the present, 
the future. The criticism, like the first 
revealed in the probability of occurrence 
tion next occurs; whether or not such 


sfying and as 


ard effect of a state of 
conceivable. The past is 
or perhaps revealed in 
» is a faulty one. The effect is 
e of the response when the situa- 


> encouraged such objections to the 
atements were indeed objectionable, but the 
objectionable statements never did express the essence of the law of 
effect, which is essentially an empirical matter, 

Translated into more familiar words, Thorndike is saying in this 
law that rewards or successes further the learning of the rewarded be- 
havior, whereas punishments or failures reduce the tendency to repeat 
the behavior leading to punishment, failure, or annoyance. So much 
would merely be a reassertion of common observations. But he went 
further and insisted that the action of consequences is direct and need 

T Thorndike ( 1913b), page 2. 
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not be mediated by ideas. In this insistence his law of effect anticipates 
the reinforcement principle adopted in many conditioned-response 
theories, The later changes in theory reduced the importance of an- 
noyers relative to satisfiers and added some new phenomena, but the 
central importance of a modified law of effect persisted in Thorndike’s 
final statements of his position. 


Subordinate Laws 

aws of readiness, exercise, and effect were said to have 
aws applicable to both animal and human learning.’ 
associative shifting, which is so similar to one 


The major | 
five subsidiary 1 
Among these occurs one, 
variety of conditioned response theory that it deserves special mention. 
account of Thorndike’s views, the impression may be 
given that Thorndike was a very systematic writer. His “system,” apart 
from a few persistent preferences, is in fact a rather loose collection of 
rules and suggestions. What was called a “law” at any one time was a 
statement which at the time apeared to Thorndike to have some general- 
ity of application. No effort was made to retain internal coherence among 
the concepts used, or to establish any genuine relationship of co-ordina- 
tion or subordination among the laws. The five “subordinate laws” to be 
discussed are principles which seemed to Thorndike somewhat less im- 
portant than the major laws of readiness, exercise, and effect. They are 
not related to the major laws in any clear manner, and in later writings 
they were occasionally omitted, occasionally revived. 

1. Multiple response. The first of the five principles is that of multi- 
ple response or varied reaction. In order for a response to be rewarded, 
it must occur. When he faces a problem the learner tries one thing after 
another, When the appropriate behavior is stumbled upon, success fol- 
lows and learning is possible. Were the organism unable to vary its re- 
rect solution might never be elicited. 


sponses, the cor Rank xa He ; 
2. Set or attitude. The second principle is that learning is guided 


e organism. Responses are determined 


In a short 


by a total attitude or “set” of th ae eel Sion 
in part by enduring adjustments characteristic of individuals raised in 


a given culture. But they are also influenced by more momentary ten- 
dencies. The attitude or set determines not only what the person will do, 
but what will satisfy or annoy him. Thorndike says that a more ambitious 
golfer will be annoyed by shots which the more modest would cherish. 
This principle is related to a series of conceptions coming to prominence 


later in discussions of level of aspiration. 


93-31. 


8 Thorndike (1913b), pages 
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is) 


THEORIES OF LEARNING 
3. Prepotency of elements. The third principle states that the learner 
is able to react selectively to prepotent elements in the problem. That is, 
a man can pick out the essential item and base his responses upon it, 
neglecting other adventitious features which might confuse a lower 
animal. This ability to deal with the relevant parts of situations makes 
analytical and insightful learning possible. 

4. Response by analogy. The fourth principle is that of assimilation, 
or response by analogy. How does man react to novel situations? He re- 
sponds to a new situation as he would to some situation like it, or he 
responds to some element in the new situation to which he has a re- 
sponse in his repertory. Responses can always be explained by old ac- 
qnisitions, together with inborn tendencies to respond; there is nothing 
mysterious about responses to novelty. 

5. Associative shifting. The fifth of these subsidiary laws is called 
associative shifting. The fundamental notion is that if a response can be 
kept intact through a series of changes in the stimulating situation, it 
may finally be given to a totally new stimulus. The stimulating situation 
is changed first by addition, then by subtraction, until nothing from the 
original situation remains. Thorndike illustrates by the act of teaching 
a cat to stand up at command. First a bit of fish is dangled before the 
cat while you say, “Stand up.” After enough trials, by proper arrangement, 
the fish may be omitted, and the oral signal will alone evoke the re- 
sponse. The most general statement of the principle of associative shift- 
ing is that we may “get any response of which a learner is capable as- 
sociated with any situation to which he is sensitive.” ° This is obviously 
related to that type of conditioning in which the process is described 
as substituting a conditioned stimulus for an unconditioned one, Thorn- 
dike noted the similarity, but believed the conditioned response to be 
a more specialized case under the broader principle of associative shiftin gl 
While in his earlier writings associative shifting was but the fifth of 
the subordinate laws, in later hooks it was “promoted,” becoming a kind 


of learning second to that by selecting and 
Controlling Learning 
There is always some dan 


writer's influence if attention is 
eralized laws which he proposes, 


connecting.” 


ger of misunderstanding a systematic 
confined to the more abstract and gen- 
to the neglect of some of the accessory 


details which give both flavor and body to his teaching. Thorndike as 


” Thorndike (1913b), page 15, 
10 Thorndike (1985), pages 191-197. 
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early as 1913 was giving much more attention to the dynamics of learn- 
ing than a formal consideration of his laws suggests. 

Within the framework of his primary laws, he saw three considera- 
tions which affected the teacher's problem in using them in the class- 
room."! These were: (1) ease of identification of the bonds to be formed 
or broken; (2) ease of identification of the states of affairs which should 
satisfy or annoy; and (3) ease of application of satisfaction and annoy- 
ance to the identified states of affairs. The teacher and the learner must 
know the characteristics of a good performance in order that practice may 
be appropriately arranged. Errors must be diagnosed so that they will 
not be repeated. When there is lack of clarity about what is being taught 
or learned, practice may be strengthening the wrong connections as well 
as the right ones. At the same time, needed connections may be weak- 
ened by disuse. It is especially hard to teach imagination, force, and 
beauty in literary expression because it is difficult to be specific about 
the conduct which should be made satisfying at the time it occurs. The 
importance of specificity runs throughout Thorndike’s writings. As we 
shall see later, this is at once a source of strength in his system and one 
of its points of vulnerability. 

But Thorndike’s advice is not limited to the application of his major 
laws. He refers also to a number of motivational features not readily 
deducible from the laws of readiness and effect. Five aids to improvement 
he lists as the interest series.’* These he believes to be commonly ac- 


cepted by educators: 


. Interest in the work 

` Interest in improvement 
. Significance 

. Problem-attitude 

5, Attentiveness 


moO 


To these five he added two more which he felt were open to some 


dig jute, The were the absence of irrelevant emotion and the abiens 
of ool a emphasis upon satisfiers and annoyers he is not talking 


about “orude emotional states,” which he believes are to be avoided. 
ill, the balance turns again toward freedom 


i In the case of abe pte a Pa eipey Lorca rare 
tom all the crude emotional state rom al , save 
the tb Origa of success and a firm repudiation of errors which can 
hardly be called exciting- + 


11 Thorndike (1918b), pages 913-217. 
1 Thorndike (1918p), pages 217-220. 


18 Thorndike (1913b), pages 926-227. 
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Thus to his rather harsh and brittle doctrine of specificity of con- 
nections he adds informal considerations which do much to temper it. 
The active role of the learner, who comes to the learning situation with 
needs and problems which determine what will be satisfying to him, is 
recognized implicitly in the commentary on the laws, although it lacks 
explicit statement in the laws themselves. It is probably these accessory 
features which have commanded ardent support by Thorndike’s fol- 
lowers, while it is the more abstract features which have been the focus 
of attack by those who have disagreed with him. 


k The Identical-Elements Theory of Transfer 


Schools are publicly supported in the hope that more general uses 
will be made of what is learned in school. To some extent all schooling 
is aimed at a kind of transfer beyond the school. Whether the proper 
way to achieve this end may turn out to be to te 


ach more formal subject 
matter, like mathematics and the cl 


assics, or to give more attention to 
practical subject matter like manual training and social studies, the 
problem is a central one for educators. 

Thorndike early interested himself in the problem. His theory began 
to take form in an experimental study done in collaboration with Wood- 
worth 1t and was formally stated in his early Educational psychology 
(1903). The theory proposes that transfer depends upon the presence 
of identical elements in the original learning and in the new learning 
which it facilitates, These may be identities of either substance or pro- 
cedure. Thus the ability to speak and write well are important in all 
schoolroom classes and many tasks of ordinary life. Hence mastery of 
these skills will serve in different pursuits, and transfer will be effected 
through what the different situations require in common. The substance 
of what is required in different situations may be unlike, but there may 
be procedures in common. The procedures of looking things up in such 
diverse sources as a dictionary, a cookbook, and a chemist’s handbook 
have much in common, despite the unlike contents of the three kinds of 
book. If an activity is learned more easily because another activity was 
learned first, it is only because the two activities overlap. Learning is al- 
ways specific, never general: when it appears to be general, it is only be- 
cause new situations have much of old situations in them. 

Intelligence as measured by tests may be thought of as to some ex- 
tent a measure of the transfer-capacity of an individual. That is, the test 
measures the ability to give right answers in relatively novel situations. 

14 Thorndike and Woodworth (1901). 
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It is logically sound that Thorndike’s theory of intelligence is, like his 
theory of transfer, a matter of specific connections. The more bonds the 
individual has which can be used, the more intelligent he is. 

Thorndike’s specificity doctrines of transfer and of intelligence have 
been highly influential and have led to a great deal of experimental work. 
Although the problems of the nature of and measurement of intelligence 
lie outside the scope of this volume, there will be occasion later to con- 
sider some of the alternative explanations of transfer. 

During the stable period of Thorndike’s system there were many 
changes in psychological climate, but these left him unruffled. The rise 
of behaviorism and the new importance attributed to the conditioned 


response affected him but little, because the new enthusiasts were talking 
even when they included him in their sweeping 
attacks on everything which preceded them. The attacks by the gestalt 
psychologists in the 20's were more telling, and he began later to meet 
some of their criticisms. But it was his own experiments which led him 
to come before the International Congress of Psychology in New Haven 
in September, 1929, with the statement, “I was wrong.” He there an- 
nounced two fundamental revisions in his laws of exercise and effect 
which became the basis for a number of publications dating from 1930. 


a congenial language, 


CONNECTIONISM AFTER 1930 


visions of his fundamental laws were reported by Thorndike 
in a number of journal articles and monographs with various collabora- 
tors, the main results being gathered in two large volumes under the 
titles The fundamentals of learning (1932a) and The psychology of 
wants, interests, and attitudes (1935). The law of exercise was practically 
renounced as a law of learning, only a trivial amount of strengthening 
of connections being left as a function of mere repetition. The law of 
effect remained only half true, the weakening effects of annoying conse- 
quences being renounced. For the two central laws there was substituted 


half the original law of effect. 


The re 


Disproof of the Law of Exercise 
of experiment used to disprove the law of exercise was 
nt on under circumstances in which the law 


of effect could not be applicable. For example, repeated attempts to draw 
a line exactly 3 inches long while blindfolded did not lead to improve- 
ment, no matter how frequent the repetitions. Practice brings improve- 


The type 


that in which repetition we 
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ment only because it permits other factors to be effective; practice itself 
does nothing. Thorndike intended that his “repeal” of the law of exer- 
cise should be a safeguard against its misuse, not a denial of the im- 
portance of controlled practice. The laws of habit strengthening must 
be those of the conditions under which practice takes place; mere repe- 
tition of situations is not enough. If the person is informed each time 
after his attempt to draw a 3-inch line that his product is too long or too 
short, he will improve with repetition.» 

The law of exercise was not, in fact, fully repealed. It is said that 
Tepetition of situations produces no change in strength of connections, 


but repetition of connections may produce a small advantage for that 


ched to the same situa- 
all practical purposes, 
> not by just occurring. 
ative summaries which 


ut six times as much as 
it would be strengthened by merely occurring.'6 


The Truncated Law of Effect 


A number of experiments yielded data showing that the effects of 


ward appeared to be much more powerful than punishi 
clusion, if confirm 


application as edu 

One of these 
gave the chick the choice of three pathw. 
dom, food, and company’—that is, to 


AT A simple maze 
ays, one of which led to “free- 


“The results of all comparisons by 


iy Trowbridge and ig (1932). There is indeed more regul: 
in his own experiments than Thorndike believed, accordi t y ati is 
data by Seashore and Bavelas (1941). a ie 
16 Thorndike (1940), page 14, 
1 Thorndike (1932b). 
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warding a connection always strengthened it substantially; punishing it 
weakened it little or not at all.” +$ 

The corresponding experiment with human subjects consists of a 
multiple-choice vocabulary test. For example, a Spanish word is given 
with five English words, one of which is its correct translation. A second 
and a third word follow, and so on through a list, each word with alterna- 
tive translations arranged in the same manner. The subject guesses the 
word which is correct, underlines it, and then hears the experimenter 
say Right (“rewarded” response) or Wrong (“punished” response). How 
will he change his responses the next time through the list? As with the 
chicks, reward leads to repetition of the rewarded connection, but punish- 
ment does not lead to a weakening of the punished connection. In six 
experiments of this general sort, Thorndike concluded that the announce- 
ment of Wrong did not weaken connections enough to counterbalance the 
slight increase in strength gained from just occurring.’® 

Thorndike and his staff went on to collect a series of testimonials 
about the relative efficacy of rewards and punishments from published 
biographies and other sources, going back many years. The almost uni- 
versal evidence of the greater beneficial effect of reward than of punish- 
ment gives practical support to the findings of the experiments, which 
otherwise would be criticized as too far removed from ordinary life.?° 

There were some statistical difficulties in Thorndike’s interpre- 
tations of his data which caused him to underestimate the significance 
of punishment. On the whole, however, he probably did a service through 
calling attention to the asymmetry of the effects of reward and of punish- 
ment. 
As in the disproof of the law of exercise, the repeal of the principle of 
Weakening by annoying after-effects is not absolute. It is only direct weak- 
ening which is denied. Punishments do, according to Thorndike, aftect 
learning indirectly. Thorndike says that their indirect effect comes 
chiefly from leading the animal to do something in the Presence of the 
annoyer which makes him less likely to repeat the original connection. 


But this is not necessarily the case. 


An annoyer which is attached to a modifiable connection may cause the 
animal to feel fear or chagrin, jump back, run away, wince, cry, perform the 
same act as before but more vigorously, or whatever else is in his repertory as a 

a ) 


response to that annoyer in that situation. But there is no evidence that it takes 


$ Thorndike oan page a 
Thorndike (1932a), page 289. 
°° Thorndike (1935), pages 185-144; 248-255, 
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away strength from the physiological basis of the connection in any way com- 
parable to the way in which a satisfying after-effect adds strength to it.2* 
Thorndike was less successful in his attempts to explain the action 
of effect than in demonstrating that there are phenomena to which his 
principles apply. He distinguished between a direct confirming influence 
and the informative influence of rewards. Control of behavior according 
to the information supplied by its consequences implies mediation by 
ideas of the sort, “If I do this, I get fed; if I do that, I get slapped.” 
Thorndike believed that he kept this kind of deliberation at a minimum 
in his experiments, so that what he had to explain was the direct con- 
firmatory reaction which he said was responsible for the strengthening 
of responses through reward. This confirming reaction is vaguely de- 
scribed as an “unknown reaction of the neurones” which is aroused by 
the satisfier and strengthens the connection upon which it impinges.*? 
The confirming reaction is said to be independent of sensory pleasures, 
and independent of the intensity of the satisfier. It is highly selective, 
depending upon aroused drives or “overhead control in the brain.” While 
such an account is far from satisfactory, it at least helps to show where 
Thorndike stood. He was against mediation by ideas, as an interpreta- 
tion of effect according to information would imply. At the same time, 
he recognized the complexity of the reinforcement process and was not 
committed to a simple hedonism. The law of effect was for him no longer 
a law of affect, as Hollingworth once named it.2* 


Belongingness 


In addition to the revisions of the laws of exercise and effect, several 
new terms entered as Thorndike’s system was revised. One of these, 
belongingness, by its recognition of an organiz 
to the structure of Thorndike’s theory 
tion, made concessions to the gestalt 
principle, a connection is more e 
the situation, and an after-effect 
tion it strengthens, Thus 
form “John is a butcher. 


ational principle foreign 
of specificity and mechanical ac- 
psychologists.** According to this 
asily learned if the response belongs to 
does better if it belongs to the connec- 
a series of sentences may be read, each of the 
Henry is a carpenter.” The association John- 
butcher is a stronger one following such a reading than the association 
butcher-Henry, even though the latter connection is based on more ne: 
contiguous items. The reason is that a sub 


21 Thorndike (1932a), pages 311-313. 
2 Thorndike ( 1933c) 

2 Hollingworth (1931). 
#1 The point was made carly by Brown 


arly 
ject and predicate belong to- 


and Feder (1934). See also Guthrie (1936b). 
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gether in a way in which the end of one sentence and the beginning of 
another do not. The belongingness of a reward or punishment depends 
upon its appropriateness in satisfying an aroused motive or want in the 
learner, and in its logical or informative relationship to the activities re- 
warded or punished. Thus to be rewarded by having your thirst quenched 
when you lift a glass of cool water to your lips is reward with belong- 
ing. If the same series of movements led sporadically to an electric shock 
on your ankle, that would be punishment without belonging. Although 
Thorndike states that after-effects are influential without either belong- 
ingness or relevance, he points out at the same time that they are more 
effective when they do belong and when they are relevant.** 

While the principle of belongingness may be interpreted as something 
of a concession, the principle of polarity is emphasized as defying gestalt 
principles.** The principle of polarity is that connections act more easily 
in the direction in which they were formed than in the opposite direc- 


tion. If you have learned the items of a German-English vocabulary, it 


is easier to respond to the German word by its English equivalent than 
alent. If a connection is thought 


to the English word by its German equiva 
Je, Thorndike contends, the polarity principle ought not 


of as a new who 1 
easy to dissociate parts from the 


be important. It ought then to be as 
whole in one direction as in another. 

Other new terms introduced, ` ; 
ability,” “availability,” and “mental systems, represent informal exten- 


sions of notions already latent in the earlier writings. 


he Spread of Effect 

In 1933 a new kind of experimental evidence was offered in support 
of the law of effect, evidence described as the spread of effect" The 
experiments purported to show that the influence of a rewarding state 
of affairs acts not only on the connection to which it belongs but on 
adjacent connections both before and after the rewarded connection, the 
effect diminishing with each step that the connection is removed from 
the rewarded one. The effect acts to strengthen even punished connec- 
tions in the neighborhood of the rewarded one. The experiments lent sup- 
port to the automatic and mechanical action of effect. A characteristic 
experiment was that in which the subject was asked to state a number 
from 1 to 10 following the announcement of a stimulus word by the 


such as “impressiveness,” “identifi- 


Discovery of t 


25 Thorndike (1935), pages 52-61. 
26 Thorndike (1932a), page 158. 
27 Thorndike (1933a) ( 1933b). 
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experimenter. The experimenter then called his response Right or Wrong, 
these “rewards” and “punishments” conforming either to a prearranged 
assignment of correct numbers to each word, or to some systematic pat- 
tern of “rights” or “wrongs.” In either case, the assignment of numbers 
from the point of view of the subject is arbitrary, and the cue to re- 
peat the number first assigned or to change it comes from what the ex- 
perimenter says following each word. The lists are so long that the sub- 
ject cannot recall on the second trial just what was done on the first one. 

After the list has been read a number of times in this manner, the 
responses of the subject are classified to find the number of times the 
response was repeated. Not only are the rewarded responses repeated 
more often than the others, but responses called Wrong are repeated be- 
yond chance expectancy if they occur near in time to a response called 
Right. To some extent the phenomena included in the spread of effect 
come nearest to a “discovery” in the whole of Thorndike’s work. Because 
of the novelty of the phenomena and their systematic relevance, typical 
experiments in support of and critical of the spread of effect are re- 
viewed below as illustrative of Thorndike’s influence u 


pon experimen- 
tation in the field of learning. 


EXPERIMENTS ON THE SPREAD OF EFFECT 


The spread of effect is the most characteristic feature of conte 


rary connectionism. It illustrates at once both the rel 
rewards 


mpo- 


ative potency of 
and punishments and the semi-automatic manner in which ef- 


fects act upon connections, whether or not they “belong.” To the extent 
that the phenomena of the spread of effect stand upon firm ground, 
Thorndike’s basic conceptions are buttressed against attacks by his critics, 


Punishment More Effective Than Thorndike Believed 


Thorndike had already changed his mind about the effectiveness of 
annoyers in weakening connections before the spread-of- 
ments were announced. He interpreted his results on the 
fect as ‘confirming his recently acquired belief in the relati 
ness of punishments in weakening connections, The spre 
then, to him, a spread of positive effects, a gradient of r 
is said to strengthen eyen neighboring punished connecti 


In several of his experiments, Thorndike made a fa 
about the baseline of ch 


tive in a series of multip 


effect experi- 
spread of ef- 
ve ineffective- 
ad of effect is 
eward. Reward 
ons. 


ulty assumption 
ance expectation. He assumed that one alterna- 


le possibilities was as likely to occur as another. 
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That is, if there were four choices, by chance each should occur 25 per 
cent of the time. When the opportunity is given to repeat the responses, 
the same one chosen last time should be chosen again in 25 per cent of 
the cases, if the tendency to respond has been neither strengthened nor 
weakened during the first occurrence. In practice this chance repeti- 
tion seldom occurs, for whatever predisposition leads to a preference 
for one of the choices the first time tends to favor that same choice the 
next time. Suppose there are four alternatives, and the series is presented 
twice. If there is a 35 per cent agreement on choices between the first and 
the second time, this does not mean that the gain over chance resulted 
from repetition. The above-chance preference may have been there on 
the first trial. This criticism has been made by Stephens (1934) and Hull 
(1935d), among others, and empirical results show that punishment does 
in fact lead to fewer repetitions. The result of Thorndike’s use of too 


low a base for chance repetition was automatically to assign too much 


strengthening to rewards and too little weakening to punishments. The 


asymmetry could be entirely a statistical artifact. 

Tilton, a former associate of Thorndike, repeated the spread-of-effect 
areful controls to determine what the empirical level 
be without the saying of Right and Wrong, and 
lot the spread of effect on either side of a rewarded 
e.25 He made a correction also for a serial posi- 
tendency to repeat the same response to items 
he end of the list. When correction is made, it 
is found that the effects of Right and Wrong are about alike, the an- 
nouncement of Wrong decreasing repetitions about as much as Right in- 
creases them. A replotting of Tilton’s results is shown in Figures 1 and 2. 

Tilton’s study shows that there is a tendency for punished responses 
in the neighborhood of reward to be repeated more frequently than such 
responses remote from reward. Their punishment (being called Wrong) 
suffices however, even one step from reward, to lead to less repetition 
ihan would 'ocour i£ the response were neither rewarded hor punished 
(Figure 1). Similarly, when a response called Wrong (punished) oc- 
curs in the midst of a series of rewarded responses, the neighboring re- 
warded responses are repeated less frequently than they would have 
been had they not been in the neighborhood of the punished response,’ 
Again, however, their reward (being called Right) is enough to lead 
to their repetition at a greater frequency than that represented by the. 


neutral baseline (Figure 2). 
28 Tilton (1939) (1945). 
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Because Tilton used the conventional method of obtaining spread of 
effect, repeating lists over and over again, and allowing right responses 
and wrong responses to appear with any degree of separation, he had to 
make a rather elaborate analysis in order to exhibit just what was in 
fact happening within his experiment. Martens (1946) introduced a cor- 
rective by arranging the experimental situation so that each position 
before and after reward has an equal opportunity to be represented in 
the final spread of effect pattern. This she did by limiting the training 
and testing to one trial each on each list. That is, on one trial through 
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FIGURE 1. Gradient of effect around a Right response. Calling a response Right 
increases its repetition, calling a response Wrong decreases its repetition; but the 
decrease is less when the Wrong is near to an isolated Right. Redrawn from Tilton 
(1945). 


FIGURE 2. Gradient of effect around a Wrong response, Calling a response 
Wrong decreases its repetition, c: 


j 2 alling a response Right increases its repetition, but 
ie i is less when the Right is near to an isolated Wrong. Redrawn from Tilton 
1945). 


a list of words the subject arbitrarily assigned numbers and was informed 
each time whether the number was right or wrong. The experimenter was 
able to assign the word Right to prearranged positions, all other posi- 
tions being assigned the word Wrong. Then a single additional trial was 
used to determine the extent to which the sub 


ject would repeat the num- 
bers called Right and repeat or change the numbers called Wrong. The 
statistical handling and interpretation of the effects of Right and Wrong 


are greatly simplified, for according to the prearranged pattern there 
are an equal number of opportunities to give the same response or a 
different response to each of several positions before and after the re- 
warded connection, The control situation consisted in the subject’s assign- 
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ing numbers as in a free association experiment, with the list repeated 
twice, no announcements of Right or of Wrong being used.” Her re- 
sults in this modified situation supported Tilton’s, so far as the effect 
of a rewarded response imbedded in the midst of punished responses is 
concerned. That is, the effect of Wrong, even adjacent to the rewarded 
connection is greater than whatever influence can be attributed to the 
spread of the effect of Right. The direct effects of both Right and of 


Wrong are greater than any spread from one to the other. 
Stephens (1941), like Tilton, a former student of Thorndike, has 


made an_ interesting me “symbolic” reward 
strengthens weak connections more than “symbolic” punishment weak- 
ens them, but symbolic punishment weakens strong connections more 
than symbolic reward strengthens them. According to this view, it was 
because Thorndike tended to work mostly with weak connections that 
he got the results he did. Stephens’ experiment was done by using state- 
ments such as appear on attitude scales to determine certain convictions 
held strongly by the subjects, others held less strongly. Then a law-of- 
effect experiment was arranged, in which multiple-choice answers were 
permitted to selected statements, and the subject was informed which 
choice was correct. The method of information was the ingenious one 


of using chemically prepared paper so that the correct response came 
hen it was marked by the instrument provided. This per- 


done with groups. When the experiment was 
Je to see which tendency was the greater, to 
d correctly or to change from the response 
answered incorrectly. The change was found to be correlated with the 
conviction represented by the answer. The better-established replies were 
more influenced by punishment, the less-well-established by reward. 
There is a difficulty of interpretation in this experiment which may 
help to account for uncertainty in other experiments about the relative 
influence of reward and punishment. There is a logical difference be- 
tween responding in the intelligent direction to Right and to Wrong. 
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makes possible immediate rehearsal; the task is clear. The intelligent re- 
sponse to Wrong is to do something different, but what to do is less 
clear. It is necessary both to remember what not to do and to form some 
sort of hypothesis as to what to do. Under time pressure this vagueness 
might well produce an asymmetry between responses following Right 
and Wrong. In the case of poorly established convictions, it may be 
harder to remember what was said that was called Right than in the 
case of well-established convictions to remember what was said that 
was called Wrong. If that were true, Stephens’ results would follow. 
Anything which makes it easier to make the necessary discriminations 


in one case than in another will effect the rel: 


ative potency of hearing 
Right or Wrong, 


and of modifying subsequent behavior accordingly. In 
other words, the situation is not nearly as mechanical and stupid as it 
appears; in fact, the results can be best understood on the assumption 


that the individual is doing his best to act intelligently under somewhat 
confusing circumstances, 


The Theory of Discrete Connections Probabl 


The spread of effect as Thorndike describes it presupposes a series 


of stimulus-response connections with no other organization among them 
than succession in time. V hat i 


y Faulty 


esponse which accompanied 
ve given evidence that additional 
and the automatic spread of the in- 
nections is either incorrect or a gross 
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ther support from the experiments of Jenkins and Sheffield (1946). They 
show that guessing habits are important in the usual spread-of-effect 
experiments. Guessing habits refer to patterns of response, not to iso- 
lated stimulus-response preferences. They found that if a rewarded re- 
sponse was repeated, other responses following it were also repeated; if 
a rewarded response was not repeated, other responses in its neighbor- 
hood were not repeated beyond chance levels. Hence the repetition of a 
rewarded response appears to be more important than the fact of re- 
ward. The major effect of reward is to lead to repetition of the correct 
response; once this success is achieved, the repetition of responses after 


the reward depends upon guessing habits.*° 


Computed random series 
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STEPS BETWEEN FIRST AND SECOND OCCURRENCE OF A DIGIT FROM | TO 6 
FIGURE 3. Distribution of steps from the first to the second occurrence of a digit 
from “one” through “six” in a random series and ina ‘subjectively random’ series 
produced by 15 subjects. Note that the subjects typically avoid using the same digit 
a second time until other digits have been used, From Smith (1949), page 360. 


A related but supplementary suggestion has been made and verified 
by M. H. Smith, Jr. (1949). In what he calls the “probability-bias hy- 
pothesis,” he points out that subjects in arenpung to respond with a 
random series of digits tend to avoid using the same digit over again 
until most of the digits have been used up. In a chance series, the digit 
has an equal chance of coming up on each trial; its appearance on any 
one trial is not influenced by the fact that it may have occurred on the 


ing s hypothesis has been amply supported through additional 
30 The guessing sequence hyp collaborators, and others: e.g., Jenkins and Cunning- 


yee bye seme ied), Fagan and North (1951), Sheffield and Jenkins 


(1952). 
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preceding one. Intervals between two occurrences of the same digit in a 
“random” series are plotted in Figure 3 along with the intervals between 
two occurrences in a “subjectively random” series. The striking differ- 
ence between the curves arises because the subjects, in attempting to 
produce a random series, postpone the recurrence of a digit once used. 
How can this bias against repetition produce a spread of effect? The 
chance of repeating a digit used on the preceding trial following a cor- 
rect (Right) response is enhanced by the avoidance of the digit just 
used as the “correct” response. While this effect is slight, so is the amount 
of the spread of effect as commonly found. A consistent slight bias of 


this kind is enough to produce statistically significant results, 
There are thus seen to be orga 


nizational factors in the spread of 
effect not described simply by remo 


teness of connections from rewards, 


Is “Reward” the Basis for the Enhancement Effect? 

The presumption is made in Thorndike’s ex 
tition of the response called Right results direct] 
may not be the case is shown in the experiments of Wallach and Henle 
(1941, 1942). The subject believed himself to be in an experiment on 
extrasensory perception, so that as he went through a list he w 
to assign the number which had been assigned by someone else for that 
particular trial. Hence a response called Right on one trial might be in- 


correct on the next trial. There was thus no intent to learn the Right 


responses, and in fact the hearing of the word Right did nothing to im- 


‘warded response or its near neighbors. 
is trying to do is more important than 

e's insistence on the automaticity of the 
effect of reward, 


periments that the repe- 
y from reward. That this 


as trying 


and perhaps of its near neigh 


obtained by inserting within a list of black lower 
to be called Right (a) 


highest degree of isolatio 
degree of isolation, and 


31 Zirkle (1946b), 


-case words as the words 
a nonsense syllable in red capital letters, the 


n, (b) a word in black capital letters, the next 
(c) a word printed the same as all the other 


THORNDIKE’S CONNECTIONISM 37 


words, in black lower-case letters, but isolated solely by being called 
Right in the midst of consecutive words called Wrong. The results are 
plotted from Zirkle’s data in Figure 4. His prediction was fully con- 
firmed. Not only was the most isolated response most regularly repeated, 
but its repetition was followed by the enhanced repetition of the words 
immediately following. Because the spread of effect was not demon- 
strated for the words immediately preceding the isolated word, it is 
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only by being called Right in the midst of words called Wrong. Plotted from Zirkle 
(1946p). 


FIGURE 4. Gradient of 


led to the repetition of the isolated word, and 
duced the after-gradient. This conjecture 
e experiments of Jenkins and Postman 
no gradient around an isolated stimulus 


probable that isolation 
that guessing habits then pro 
is indirectly supported by th 
(1948), in which they found 


when guessing sequences were eliminated. 
In his experiment on spread of effect when there was neither reward 
n his ex 


nor opportunity for learning, Sheffield (1949b) found the typical “spread,” 
but the level of repetition was lower than in the usual Thorndike experi- 
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ment. While Thorndike * had found the average per cent of repetition 
for the position following reward to be 20.4, the corresponding per cent 
in Sheffield’s experiment was 13.3. Was there after all some effect of “re- 
ward,” or can the difference in level be accounted for on other grounds? 

Sheffield and Jenkins (1952) set out to determine whether or not 
the reward was responsible for the enhancement in Thorndike’s experi- 
ments. They continued to use Sheffield’s procedure of having subjects 
assign numbers at random, without any statement of Right or Wrong or 
any instructions to attempt to repeat. But the numbers were assigned 
while the subject was engaged in irrelevant learning tasks, approximat- 
ing the kinds of tasks used in Thorndike’s experiments. Depending upon 
the task used, percentages of repetition equal to and greater than those 
found by Thorndike could be produced—all without reward and any 
learning related to the number sequences. A few of the results are sum- 
marized graphically in Figure 5. A change in instructions, and the con- 
cealing of the record of previous replies, raised the level of repetition, 
without any new task being introduced; when an irrelevant learning task 
was used, with a fixed order of stimulus presentation (typical of Thorn- 
dike’s practice), the repetition of other ordered stimuli led to an en- 
hanced repetition of the supposedly random numbers; finally, a paired- 
associates task in which the assigning of a random number followed the 
attempt to recall the response member of the associated pair led to the 
highest level of repetition of numbers on succeeding trials. These experi- 
ments lend further support to the Suessing-sequence hypothesis, and ap- 
pear to make unnecessary the attributing of the spread of effect to a con- 
trol by way of reward. 

One dissent in recent years from the negative findings on the spread 
of effect is that of Duncan (1951). Aware of the many criticisms of the 
spread of effect, he designed an experiment in which the stim ulus-member 
of the associated items would again be important. (It must be remem- 
bered that most of the criticisms center around the fact that the spread 
of effect is generated by response-habits having little to do with the na- 
ture of the stimuli presented.) To this end he used two sets of stimulus 
words. One set consisted of unrelated adjectives (“ancient,” “downcast,” 
“mutual” ) while the other consisted of adjectives very similar in mean- 
ing (“superb,” “choice,” “unique”). Duncan suggested that saying Right 
to a number assigned to one of the similar adjectives might very well 
lead to a repetition of a number assigned to another similar adjective. 


22 Thorndike (1933b). Th 
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In other words, the generalization tendencies might lead to an enl 

ment of the spread of effect, through uncertainty as to the exact ‘douse 
the reward. By the use of the control adjectives of dissimilar eene 
prešumably guessing tendencies and the like would be balanced pe 
His results were largely ambiguous, but the one statistically significant 
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_ FIGURE 5. Modified conditions of experimentation raising the level of repetition 
in a “spread-of-effect” type of experiment without reward or relevant learning. Plotted 


from the data of Sheffield and Jenkins (1952). 


difference he found favored his hypothesis and thus gave some slight 
support to Thorndike’s position. Perhaps the main lesson to be learned 
is that it is very difficult to design the finally decisive experiment, and 
despite all its critics the spread of effect is not yet permanently de- 


molished.** 


The suggestion from Duncan’s experiment that uncertainty as to 

33 There are some unfortunate weaknesses in Duncan’s experiment that prevent its 
making a strong case for keeping the spread of effect alive. For example, he finds more 
of a gradient (more repetition, and more difference between the two adjacent positions) 
following a “rewarded” response not repeated than following a “rewarded” response that 
is repeated. Perhaps his too-similar adjectives have displaced the action of reward 


rather than spread it! 
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what was rewarded may have something to do with the spread of effect 
receives indirect confirmation from experiments by Nuttin (1949) (1953). 
The subject was given a list of 40 well-known stimulus words. Each 
word was followed by five two-syllable words of an “unknown language 
(actually artificially constructed nonsense words), He was asked to 
choose the word that was the exact translation of the stimulus-word, and 
then told whether he was Right or Wrong. The rights and wrongs were 
assigned in a prearranged manner. The list was then repeated a second 
time, but now the subject was asked to repeat the reply he had given the 
first time, adding whether the choice had been called Right or Wrong. 
Nuttin found that wrong items next to ones called Right were remem- 
bered as having been called Right more often than those farther re- 
moved. There is thus some subjective uncertainty as to the locus of the 
experimenter’s assigned Rights and W rongs. ( 
neighborhood of a Wrong 
called Wrong.) 


Correct responses in the 
were also often remembered as having been 


Some Methodolo 


gical Lessons from the Spread-of-Effect 
Experiments 


The spread-of- 
pitfalls confronting 
experiments relev: 
mized Thorndike’ 
logical”) influe 
The experimen 


effect experiments stand as a warning of the many 
the experimenter seeking conscientiously to design 
ant to theory. The spread-of-effect experiment epito- 
s final system: discrete connections, automatic (“bio- 
nce of rewards, ascendancy of reward over punishment. 


effect experiments: 
ut if they will repeat the original 
at they will be convinced.” *4 

earning experiment are complex. The 
e experiments critical of Thorndike has been to 
call attention to a number of these factors that he either overlooked or 


slighted in his description and explanation of the spread phenomenon. 
Some of these are: 
1. The phenomena ar 


e influenced by the set or instructions that de- 
termine what the subject 


is trying to do.* 


84 Thorndike (1949), page 27. 
#8 Wallach and Henle (1941), 
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2. The learning task undertaken may be irrelevant to the spread 
phenomena, yet influence the level of repetition.** 

8. An isolation or emphasis effect is as important as reward in de- 
termining which response is most frequently repeated.’ 

4. Guessing habits result in a gradient of response around the focal 
response that is repeated for any reason whatever." 

5. A probability bias operates so that a subject repeats a digit at 
less than chance frequency when attempting to respond in random 
fashion. This avoidance of one or more previously used digits leads to a 
beyond-chance repetition of the remaining digits.” 

6. Integration of serial acts through anticipatory responses, remote 
associations, etc., produces types of gradient superficially similar to, but 
not causally identifiable with, the spread of effect.*° 

If errors enter because of these six causal factors, they are errors 
in explanation or interpretation. They do not in themselves imply poor 
experimentation, for the factors may themselves be systematically varied 
and their effects studied. 

There are, in addition, errors in the treating of data that led to 
systematic distortions of the results as these results were reported from 
many of the earlier experiments on spread of effect that attempted to do 
just what Thorndike recommended, that is, to repeat his experiments in 
full. Three of these are worth noting: 

1. A faulty baseline of chance was selected, on the assumption that 
empirical chance levels are equal to computed random levels.*! This 
error Jed to an underestimation of the influence of punishment (saying 
WwW rong). 

2, Improper weighting of steps in the gradient by scoring responses 
between two rewarded responses (Rights) as both preceding and follow- 
ing a Right.*? This error led to a greater symmetry in the “before” and 


86 She a kins (1952). 
87 eo bees, he possible role of emphasis was earlier suggested by Tolman, 


Hall, and Bretnall (1932). The isolation effect came to Zirkle’s attention through the 
work of yon Restorff (1933). Strengthened response through isolated punishment, 
noted by Zirkle, has been confirmed by Stone ( 1953). i 

38 Jenkins and Sheffield (1946). The suggestion was earlier made by Tolman (1936), 
in a review of one of Thorndike’s books, but he did not follow it up. 

39 Smith (1949). 5 d of effect i 

40 Jenkins (1943) found data apparently supporting the spread of e ect in a maze 
experiment with rats, but anticipatory and perseverative tendencies make unsuitable an 
interpretation in terms of spread of effect to discrete connections. We shall later meet 
other gradients, such as the gradient of reinforcement, which have only a family 


resembla ad of effect. : 
nt Pointed outlby Stephens (1954), Hull (19854), Tilton (1989). 


42 Pointed out especially by Tilton (1939) (1945). 
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“after” gradients than has been found in later experiments avoiding this 
error. 

8. Repeating lists many times in the same order, thus increasing the 
proportion of Right responses through learning and reducing both the 
prevalence of Wrong responses and their distance from the nearest Right. 
As a consequence the Wrong responses that remain are predominantly 
those which (for any reason) are stereotyped and resistant to change. 
Because late in practice these are necessarily near to Right responses, it 
will appear statistically (but spuriously) that Wrong responses are re- 
peated because of their proximity to Right ones.** 

It is seldom that a single variety of experiment is as crucial to 
a systematic theoretical position as the spread-of-effect experiment was 
to Thorndike’s. For that reason its criticisms have been more pertinent 
than the criticisms of less important experiments. That the criticisms have 
also been more devastating is merely a matter of fact. Had the spread of 
effect been confirmed by repeated experiments under appropriate con- 
ditions, its explanation would have been accepted as an impressive con- 
tribution to psychological theory. 


ESTIMATE OF THORNDIKE’S POSITION 


Before proceeding to an appraisal of the contemporary significance 
of Thorndike’s position, his answers to the standard problems of learning 
will be summarized. By attempting such a summary for each of several 


leading learning theorists it will be easier to keep perspective on their 
similarities and differences, 


Thorndike’s Position on Typical Problems of Learning 


Thorndike’s answers are briefly summarized according to the six 
typical problems discussed in the first chapter. 

1. Capacity. Learning capacity depends upon the number of bonds 
and their availability, The differences between bright and dull are quan- 
titative rather than qualitative, although intelligence has dimensions of 
altitude as well as of breadth.“ The theory of intelligence is consonant 
with the identical-elements theory of transfer, 

2. Practice. Repetition of situations does n 


ot in itself modify con- 
nections. Repetition of connections leads to 


a negligible increase in 


a This criticism, although fairly obvious, was first noted in the earlier edition of this 
book. 


44 Thorndike and others (1927). 
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eth, unless the connections are rewarded. Practice is important be- 


A - ° . 
tase it permits rewards to act upon connections. 


3. Motivation. Reward acts directly on neighboring connections to 
strengthen them; punishment has no corresponding direct weakening ef- 
fect. Punishment may work indirectly, however, through making the 
learner do something else which may confront him with a reward. “Ideas” 
need not intervene; connections may be strengthened directly, without 
awareness. 

4. Understanding. The role of understanding is minimized, not be- 
cause it is undemonstrable, but because it grows out of earlier habits. 
The best way to get understanding is to build a body of connections ap- 
propriate to that understanding. When situations are understood at once 
it is a matter of transfer or assimilation, that is, there are enough elements 
in common with old situations to permit old habits to act acceptably. 

(5, Transfer. The theory of identical elements is espoused. Reaction 
to new situations benefits by the identity of these new situations, in part, 
with old situations, and also by a principle of analogy described as as- 
similation. i 

6. Forgetting. The original law of disuse assumed forgetting to take 
place without practice in accordance with the empirical findings of studies 
such as those of Ebbinghaus. Later books did not deal with the problem 
in any detail; the law of disuse was not mentioned, but some decay with 


no practice was still implied. 


The flavor of Thorndike’s theory was all along that of the automatic 
strengthening of specific connections—directly, without intervening ideas 
or conscious influences. While Thorndike was not an avowed behaviorist, 
and was willing occasionally to use subjective terms, his emphasis was 
certainly behavioral. It would be unfair to leave the discussion of- Thorn- 
dike without referring again to his insistence on measurement, and 
through that insistence his contribution to the improvement of the learning 
of skills in the schools. There was an energetic empiricism about Thorn- 
dike’s experimenting and theorizing which compensated for its lack of 


systematic elegance. 
The Specificity Doctrine a Source Both of Strength 


and of Weakness 

Thorndike gave great impetus to what has sometimes been called 
the scientific movement in education—the movement which suggests that 
educational practices be regulated according to verified outcomes of 
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specific practices. His tremendous drive led to enormous output in fields 
as varied as handwriting scales, dictionary-writing, methods in teach- 
ing arithmetic and spelling, intelligence tests, and vocational guidance. 
But the secret of his output was not only energy: the output stemmed 
also from his matter-of-fact conception of science, that in order to do 
something about anything you have to know specifically what you are 
about. 

The specificity doctrine helps you to roll up your sleeves and get to 
work. Consider, for example, all the complications involved in the teach- 
ing of reading. What is it that the child is to be taught? Philology? Gram- 
mar? Semantics? It took a Thorndike to give the simple answer: “Words.” 
With that answer he proceeded to count the frequency with which each 
word occurs in English, by tabulating millions of printed words from all 
manner of sources. He then arrived at the most common words. These 
are the words which must surely be understood. He made available lists 
and dictionaries to facilitate teaching the most needed words, A speci- 
ficity theory like Thorndike’s tells the educator where to look and how to 
measure in a baffling field such as schoolroom practices, 

The specificity doctrine is also a source of weakness, and it has been 
the target of the most severe attacks upon Thorndike. The illustration 
above shows the kinds of criticisms which Thorndike invites, Is language 
no more than words? Are the most frequent words really what we wish 
to teach? Perhaps we need to think of language as a means of expres- 
sion, as logic in action, and must therefore equip the child with the 
minimum set of tools necessary for adequate communication. That this 
approach is a possible one has been shown by the development of Basic 
English, wherein the central vocabulary of 850 words overlaps only in 
part with Thorndike’s most frequent words.** The approach of Basic 
English takes into account the organized character of language as an 
instrument of meaning. Thorndike, true to association tradition, tended to 
think of language as a collection of words, which he set out to treat 
quantitatively, 

In Thorndike the analytical emphasis of all association theory also 
pervaded the conception of rewards and punishments, and weakened 
somewhat the analysis of these phenomena. The notion that the law of 
effect works mechanically on all connections in the neighborhood of the 

rewarded one makes of reward something extrinsic to the activity in 
question, something pinned on adventitiously. In showing that the law 


48 E.g., Richards (1943). The basic list of 850 words includes some words in the 
seventh thousand of Thorndike’s count, such as advertisement and sneeze, 
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of effect may work this way, he slighted the internal relationships be- 
tween success and what the individual is trying to do, goals which satisfy 
aroused motives or needs. Again, texts may be cited in proof of the fact 
that Thorndike knew all this—he early quoted the “interest series” as 
we have shown and later added the notion of “belongingness”—but that 
does not alter the conclusion that his scientific preoccupations led him 
away from the internal relations of effort and success to the external re- 
lationship of any satisfying state of affairs strengthening any connection 
which happened to be near it. 

Perhaps more heat has been generated over Thorndike’s subordina- 
tion of insight and understanding to drill and habit than over any other 
aspect of his writings. While he thought insight very rare in animals— 
perhaps rarer than it actually is—he did not deny insight in man. He was 
not awed by it, and thought it best understood by the same associative 
laws applying in other situations. Just as erroneous inferences are made 
because of habitual associations which throw the learner off his course, 
so the insights of the genius are made by appropriate habitual associations 
and analogies. He had this to say of reaction to novel situations: 


There is no arbitrary hocus pocus whereby man’s nature acts in an unpre- 
dictable spasm when he is confronted with a new situation. His habits do not 
then retire to some convenient distance while some new and mysterious entities 
direct his behavior. On the contrary, nowhere are the bonds acquired with old 
action than when a new situation appears. 


situations more surely revealed in 
Although this comment is true enough, Thorndike’s failure to give 
real concern to the way in which past habits are utilized in problem solu- 
tion, to consider what arrangement makes a problem hard, what easy, 
: $ : . * a 
. i ` N © O 
when the same essential bonds are inv olved, is a genuine limitation. The 


difference is a real one for school practice. It is possible to teach num- 
establish the “bonds” ), then expect some glimmer 


ber combinations first ( i ; 
or it is possible to achieve some understanding 


of understanding later, HeD 
of what sone are for, to comprehend the situation as a problem, and 


then to learn the combinations in this context. in the end you come out 
at the same place, knowing the tables and knowing how to use them, but 
it is not a foregone conclusion that the one method of teaching will be 
more efficient than the other, either for knowing precisely what has been 
taught or for being able to apply it in new situations. Thorndike’s pie 
Occupation with bonds has insured that we turn to others, not Thorndike’s 
followers, for a more careful appraisal of the role of meaning and under- 


Standing. 
48 Thorndike (1913b), page 29- 
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SUPPLEMENTARY READINGS 


BOOKS 


Thorndike was a prolific writer. His bibliography appears in two parts in 
the Teachers College Record: for the years 1898 to 1940 in volume 41 (1940), 
pages 699-725; for the years 1940 to 1949 in volume 51 (1949), pages 42-45. 
The total comes to more than 500 items. 

The following books contain his major contributions to learning theory, with 
much supporting experimental data: 


THORNDIKE, E. L. (1911) Animal intelligence. 

Tuornpixe, E. L. (1913) The psychology of learning (Educational psychology, 
Vol. II). 

Tuornpikg, E. L. (1922) The psychology of arithmetic. 

THORNDIKE, E. L. and others (1928) Adult learning. 

THORNDIKE, E. L. (1932) The fundamentals of learning. 

THORNDIKE, E. L. (1935) The psychology of wants, interests, and attitudes, 


Tuornpike, E. L. (1949) Selected writings from a connectionist’s psychology. 


SHORTER INTRODUCTIONS 


Gates, A. I. (1942) Connectionism: Present concepts and interpretations, Chap- 
ter 4 in The psychology of learning. Natl. Soc. Stud. Educ., 41st Yearbook, 
Part II, 141-164. 

Rock, R. T., Jr. (1940) Thorndike’s contributions to the psychology of learning. 
Teach. Coll. Rec., 41, 751-761. 

Sanpirorp, P, (1942) Connectionism: Its origins and major features. Chapter 3 
in The psychology of learning. Natl. Soc. Stud. Educ., 41st Yearbook, Part 
II, 97-140. 

THORNDIKE, E. L. (1940) The ABC of human behavior. Chapter 2 in Human na- 
ture and the social order, 5-20. 


CRITICAL REVIEWS 


Brown, J. F., and Fever, D. D, (1934) Thorndike’s theory of learning as gestalt 
psychology. Psychol. Bull., 31, 426-437. 

Cason, H. (1932) Review of Thorndike’s Human learning. J. abnorm. soc. 
Psychol., 27, 214-229, 

HT Q E (1935) Special review of Thorndike’s The 
Psychol. Bull., 32, 807-823. 

McGeocn, J. A, (1933) Review of Thorndike’s 
J. gen. Psychol., 8, 285-296, 


Torman, E. C. (1936) Connectionism; wants, interests, and attitudes. Character 
and Pers., 4, 245-253. 


fundamentals of learning. 


The fundamentals of learning. 


REPRESENTATIVE EXPERIMENTS 


A scientific theory can best be judg 


E c c I ed by its consequences in experimenta- 
tion. This requires an appraisal of the k 


r i inds of experiments to which the theory 
leads and a review of the interpretations offered for the obtained data. A pre- 
ferred theory ought to lead to cogent experiments and to suggest convincing 
interpretations. 

Following this and succeedin: 


l g chapters there will be short lists of experi- 
ments, useful in estimating the the 


ories in this way. Many other references would 
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be equally satisfactory for this purpose. The items marked with an asterisk are 
reproduced in large part in Thorndike (1949). 


Lorce, I. (1936) Irrelevant rewards in animal learning. J. comp. Psychol., 21, 
105-128. 

Postman, L., and Jarrett, R. F. (1952) An experimental analysis of ‘learning 
without awareness.’ Amer. J. Psychol., 65, 244-255. 

Rock, R. T., Jr. (1935) The influence upon learning of the quantitative variation 
of after-effects. Teach. Coll. Contr. Educ., No. 650. 

Stone, G. R. (1953) The effect of negative inventives in serial learning: VII. 
Theory of punishment. J. gen. Psychol., 48, 133-161. 

Tuornpixe, E. L. (1908) The effect of practice in the case of a purely intel- 
lectual function. Amer. J. Psychol., 19, 374-884. 

Tuonnpike, E. L. (1910) Practice in the case of addition. Amer. J. Psychol., 
21, 483-486. i. 

Tuornpikr, E. L. (1924) Mental discipline in high school studies. J. educ. 
Psychol., 15, 1-22; 83-98. ; 

°Tuornpike, E. L. (1927) The influence of primacy. J. exp. Psychol., 10, 18-29. 

°Tuornpike, E. L. (1932) Reward and punishment in animal learning, Comp. 
Psychol. Monogr., 8, No. 39, 65 pp- 


°THornpike, E. L. (1933) An experimental study of rewards. Teach. Coll. 


Contr. Educ., No. 580, 72 pp- i 
Taoro, E. L, and Lorce, I. (1935) The influence of relevance and be- 


longing, J. exp. Psychol., 18, 574-584. 
Tanna = gate Rock, R. T., Jr. (1934) Learning without awareness of 
what is being learned or intent to learn it. J. exp. Psychol., 17, 1-19. 


FINAL NOTE 
For notices of the first edition of this book, disagreeing with my interpreta- 


tion of some aspects of Thorndike’s system, see: 
Stone, G. R. (1948) Hilgard on the dominant laws of learning. Psychol. Rev., 


5, 342-344. , ; . 
ees ea (1949) Review of Hilgard’s Theories of learning. J. abnorm. 


soc. Psychol., 44, 124-129. 
The following reviews bear importantly on the law of effect as a central 
feature of Thorndike’s system: 
Mren, P. E. (1950) On the circularity of the law of effect. Psychol. Bull., 47, 
52-75. s 
Nurri, J. (1953) Tâche, ré 
3 


DENE L. (1947) The history and present status of the Law of Effect. Psychol. 


Bull., 44, 489-563. 


ussite et échec. Especially Chapter V, pages 247- 


Chapter 3 


GUTHRIE’S CONTIGUOUS 
CONDITIONING 


N some respects, Edwin R. Guthrie’s system is very much like Thorn- 
dike’s. It is a stimulus-response association psychology, objective and 
practical. But in other respects the systems are very different. \It is such 
similarities and differences which define the problems of contemporary 
learning theory. 

Thorndike accepted two kinds of learning: selecting and connecting 
(under the law of effect), and associative shifting. For him, associative 
shifting was originally the fifth of some subsidiary principles, and by far 
the major burden was carried by selecting and connecting, For Guthrie, 
on the contrary, a conception very like associative shifting is made the 
cornerstone of his system, and learning of the trial-and-error sort is 
derivative and secondary. Guthrie does not accept the law of effect in 


Thorndike’s sense, and this is the basic cleavage between their systems. 


THE BACKGROUND OF BEHAVIORISM 
AND OF CONDITIONING 


{| Guthrie is a contemporary behaviorist w 
conditioning as the characteristic form of le 
own theories it may prove helpful to cons 
which they have emerged. 


ho lays much stress upon 
arning. Before turning to his 
ider the background against 


Watson’s Behaviorism 


Behaviorism as a “school” of psychology is usually thought of as 
originating with Watson (1913), who became its most vigorous spokes- 
man. There were other varieties of behaviorism, however, and Guthrie 
was led to his position by way of Singer (1911), with whom he had 
studied. ‘The behaviorists have in common the conviction that a science 

48 
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of psychology must be based upon a study of that which is overtly ob- 
servable: physical stimuli, the muscular movements and glandular secre- 
tions which they arouse, and the environmental products which ensue. 
The behaviorists differ somewhat among themselves as to what may be 
inferred in addition to what is measured, but they all exclude self- 
observation (introspection) as a legitimate scientific method. Partly as 
a protection against an indirect use of introspection, the behaviorists have 
tended to prefer experimentation on animals and infants. 

Watson’s Behavior: An introduction to comparative psychology 
(1914) was the first book to follow the announcement of his new posi- 
tion. In it occurred his attempted refutation of Thorndike’s law of effect 
and the substitution of the laws of frequency and recency. He believed 
that animal learning, as in the maze or problem-box, could be explained 
according to what the animal had most often been led to do in the situa- 
tion, with the most recent act favored in recall. Because the successful 
act was both most frequent and most recent, its recurrence could be ex- 
plained without recourse to an added principle of effect. This denial of 
effect was part of his program of getting rid of the residual subjectivity 
which he felt was implied in Thorndike’s concepts of satisfiers and an- 
frequency-recency theory did not survive its criticisms," 


noyers. While the eg : 
it serves to point up Watson’s desire to find objective laws to substitute 


for those with even a tinge of subjective flavor. 
The behaviorist knows that other events intervene between measured 
a $ 
m. In order to preserve a systematically 


stimuli and the responses to ther i 
are posited to be much like 


coherent position, these intervening events ; 
the observed ones, that is, implicit or covert stimulus-response sequences. 
> 


In his early studies on the control of the maze habit, Watson (1907) had 
attributed great importance to kinesthetic stimuli as integrators of the 
habits involved: Because kinesthetic stimuli are aroused as a result of the 
organism’s movements, they fit well into a behavioral or reaction psy- 
chology. Even the unobserved processes inferred to be going on between 
stimuli and responses are said to be comprised oE movements and 
movement-produced stimuli. This emphasis upon kinesthesis as the 
integrator of animal learning served Watson well when he became puz- 
zled about human thought processes. He decided that thought was merely 


implicit speech, that is, talking to 
he conjectured, would detect ton 


accompaniments of thinking. He w 
behaviorist position without denying th 


oneself. Sensitive enough instruments, 
gue movements or other movement 
as thus able to hold to his consistent 
at thinking goes on. 


2 Peterson (1922); Gengerelli (1928). 
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It was somewhat later that Watson discovered that the conditioned 
reflex of Pavlov and Bekhterev might serve as a useful paradigm for 
learning.* Because it grew out of the objective tradition. that had hap- 
pened to develop within Russian physiology, it fitted his temper and 
he adopted it enthusiastically. In Watson's later writings the condi- 
tioned reflex was central to learning, as the unit out of which habits are 
built. 

Watson's general textbook, Psychology from the standpoint of a 
behaviorist, appeared in 1919. It was soon followed by other books writ- 
ten from an avowedly behavioristic standpoint. Among these was Smith 
and Guthrie’s General psychology in terms of behavior (1921). Like 
Watson’s book, it treated all of psychology from a behavioral viewpoint 
and made use of conditioning principles. It, too, laid great stress upon 
movement-produced stimuli. Hence there is a family relationship between 
the two books, although they differ greatly in expository style. Watson 
laid far more stress upon the details of physiology and anatomy, and 
upon appropriate methods for the behavioral study of psychological re- 
lationships. Smith and Guthrie showed less concern for experimental 
and neurophysiological detail, but instead gave a plausible interpretation 
of ordinary experience as described consistently from the new standpoint. ` 
Guthrie's contemporary writings preserve the flavor of the Smith and 
Guthrie book. Despite their similarities, Guthrie’s point of view must be 
considered as something other than a working out of Watson’s position.# 


(The Conditioned Reflex or Conditioned Response \/ 


The conditioned-reflex principle is conceptually so simple as to make 
an attractive prototype for learning Certain stimuli lead regularly to re- 
flex responses. A light flashed into the eye causes pupillary constriction, 
a tap on the patellar tendon leads to a knee jerk, chemicals on the 
tongue produce salivation. Such natural reflexes are called unconditioned 
reflexes and the stimuli which produce them are called unconditioned 
stimuli. If a second stimulus, not originally leading to the response, is 
frequently presented slightly before or accompanying the unconditioned 
stimulus, it presently comes to elicit the response. Such a stimulus is 
called a conditioned stimulus and the learned response is called a con- 
ditioned reflex.)Because of the desire to extend the principle to responses 
not obviously reflex, later writers have usually preferred the less specific 


2 Watson (1916). 
3 It is suggested in Ch 


apter 5 that Hull’s position is another kind of descendant of 
Watson’s. 4 


) 


' 


— 


GUTHRIE'S CONTIGUOUS CONDITIONING 51 
word “response” to the word “reflex,” so that the movements involved 
are called conditioned and unconditioned responses, whether or not they 
conform to the definition of a reflex. 

The study of conditioned salivary responses in dogs was carried out 
Systematically by Pavlov * over many years, and he discovered most of 
the relationships which later studies have more fully explored. The trans- 
lations of his terms have become common in the literature of learning. 

The appearance or strengthening of a conditioned response is said 
to take place through reinforcement. Reinforcement may refer to the ex- 
perimental arrangement of following the conditioned stimulus by the un- 
conditioned stimulus, although the word is sometimes used to refer to 
the implied process of strengthening of the conditioned response. When 
the conditioned stimulus is presented alone, unaccompanied by the un- 
conditioned stimulus, the conditioned response is said to be non- 
reinforced. Under repeated non-reinforcement there is a tendency for 


the conditioned response to weaken or even to disappear completely. This 
rental extinction, or, more simply, extinction. By 
ither the experimental arrangement of non- 
tself. Extinction is not, in fact, 
hen the animal is returned 


is referred to as experin 
extinction may be meant e ; 
reinforcement, or the weakening process 1 
a destruction of the conditioned response, for W as) 
to the laboratory after_a_rest following extinction, the conditioned re- 
Sponse is usually föund tolaye reappeared, though commonly at reduced 
Strength. This return of response strength after extinçtion, without inter- 
Vening reinforcement, is called spontaneous recovery he simple history 
of a conditioned response is then its strengthening eke: reinforce- 
Ment, its weakening through extinction, and its partial recovery with 


Test, 
Wo additional concepts are extremely important in the extension 
; i eralization and discrimination. 


Of conditinn:. eee These are gen 
ndi rinciples. The ici 
tioning princip reinforced, may elicit a con- 


A new iti stimulus, not previously 
f: conditioned w ume k is presented. The probability that it 
t is similar to the conditioned stimulus which 


as been reinforced (Thus if a conditioned stimulus is obtained to one 


Onditi S i will also produce a 
a a slightly different frequency, wil Pp 

> another tone, at a $ oo aH 
Conditioned response. 2 ic lesser magnitude the further the separation 


3 duces a response 
Do by a novel stimulus pro 
© tones [f his process whereby oduc 
a i zation. If the tw: 
“amed to ie stimulus is known as generalization. JI a 


Stimuli are sufficiently distinguishable, the organism can ‘a Pe. ad 
SPond to one of E and to cease responding to the other. Such a dis- 


ditioneq response the firs 
Wi uy an 
ill do so is increased if i 


* Pavlov (1997) (1928). 
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crimination is produced by the method of contrasts. That is, one of the 
stimuli is regularly reinforced, the other regularly non-reinforced. The 
selective extinction which results is known as conditioned discrimination, 
because the organism has learned to react differentially to the two stimuli 
which at first evoked more nearly the same response. Discrimination may 
be between patterns of stimuli as well as between single stimuli. ) 

Pavlov’s paradigm of conditioning has come to be called classical 
conditioning to distinguish it from a variety known as instrumental con- 
ditioning. Instrumental conditioning is epitomized by an experiment to 
be described more fully in the next chapter: lever-pressing by the rat, 
rewarded by a pellet of food. Because pressing the lever is causally re- 
lated to the appearance of food the learned response is an instrumental 
one, that is, it acts upon the environment to produce a given change. The 
two chief differences between classical conditioning and instrumental 
conditioning lie in the relationships between the conditioned response 
and the reinforcing event: 

1. Classes of response. (In classical conditioning, the conditioned 
and unconditional responses fall in the same class, That is, conditioned 
salivation is reinforced by presenting an unconditioned stimulus that 
elicits unconditioned salivation. In instrumental conditioning, the two 
responses fall into different classes: conditioned lever-pressing is rein- 
forced by food, and unconditioned eating responses have little in com- 
mon with lever-pressing.) 

K 2. Contingent relationship between conditioned and unconditioned 
stimuli. In classical conditioning, reinforcement is under the control of 
the experimenter. The unconditioned stimulus (e.g., food) appears after 
a standard time interval whether or not the dog responds to the condi- 
tioned stimulus (e.g., bell). In instrumental conditioning the reinforce- 
ment i controlled by the behaving organism. That is, the rat receives 
the reinforcing stimulus (food) only if it gives the conditioned response 
of pressing the lever. 

Despite these differences in experimental arrangements, the general 
laws of reinforcement, extinction, spontaneous recovery, generalization, 
and discrimination are so similar within classical and instrumental con- 
ditioning that a common terminology has come to be used for both 
varieties, and it is a matter for theoretical discussion whether or not 
separate principles of learning apply to the two classes, 

In choosing to use what has been learned from conditioned-response 


5 A glossary of terms used in conditionin: 


g experiments may be fi in Hilgard and 
Marquis (1940), pages 341-353. he tee ia ia in 
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experiments, Watson and Guthrie selected somewhat different alterna- 
tives. Watson used the Pavlov experiment as the paradigm of learning, 
made of the conditioned reflex the unit of habit, and built his system on 
that foundation. Guthrie, unlike Watson, starts with a principle of con- 
ditioning or of associative learning, a principle which is not dependent 
strictly upon the Pavlov kind of experiment. To Guthrie, Pavlov’s ex- 
periment is a highly artificial one, useful for the relationships it exhibits, 
but itself requiring explanation according to more general principles.) 
For Watson, the classical conditioning experiment provided the answers 
he sought as to how learning takes place; for Guthrie, the classical con- 
ditioning experiment raises as many questions as it answers. } 


CONTIGUITY OF CUE AND RESPONSE: THE ONE 
LAW OF ASSOCIATION ` 


The Lowest Common Denominator of Learning 
(Guthrie’s one law of learning, from which all else about learning is 
is stated by Guthrie as follows: “A combination j 


made comprehensible, 
ompanied a movement will on its recurrence tend 


of stimuli which has acc 


to be followed by that movement.” ° ; 
There is an elegant simplicity about the statement, which avoids 


mention of drives, of successive repetitions, of rewards or of punishments. 
Stimuli and movements in combination: that is all. This one principle 
serves as the basis for a very ingenious and intriguing theory of learning. 
A second statement is needed to complete the basic postulate about 
learning: “A stimulus pattern gains its full associative strength on the 
occasion of its first pairing with a response. : ; l 
This somewhat paradoxical statement, in view of undeniable im- 
provement with practice, is a very necessary adjunct to the theory, be- 
cause it makes possible a number of derivative statements about learn- 


Ta setting. It can be thought of as a kind of recency principle, 
— aik letely in one trial, that which was last done 


for if learning occurs comp 
Š mulus combination will be that which will be done 


bination next recurs.’ ) 


in the presence of a sti 
when the stimulus com 


; : 96: Guthrie (1952), page 23. Where the quotations from 
Sar, (1985), page Tween the 1985 and the 1952 editions of his book, the 
earlier only will be cited pee 
7 Guthrie (1942), page 9%- i her than t ` in ti 
: ? sponse of a succession rather than to recency in time, 
i. a reia te that it be called the principle of postremity. 
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| How can Guthrie demonstrate that more complicated forms of learn- 
ing conform to these simple principles? As in the case of other sophisti- 
cated theorists, he does not proceed by denying familiar forms of learn- 
ing. His problem is that of showing that learning as we know it can, in 
fact, be shown not to contradict these basic principles. He does not deny 
that there is learning which may be described as insightful or purposive 
or problem-solving. It is Guthrie’s task to show that each of these forms 
requires no new Principles of explanation beyond the primary law of as- 
sociation by contiguity, ) 
( Why Strict Contiguity of Measured Stimulus and 
‘Response Is Not Essential) 
One of the standard experiménts in the literature 
that showing the importance of the time interval be 
stimulus and unconditioned response. The empiri 


out for strict simultaneity of cue and response 
in the face of these data by Proposing that the true cue being conditioned 
is not the stimulus as measured. An external stimulus may give rise to 
movements of the organism. These movements in turn produce stimuli, 
When associations appear to be made between stimuli and responses 
Separated in time, it is because these intervening movements fill in the 
gap. The true association is between simultaneous events, ) 


\There is a strong preference for movement-produced stimuli as the 
true conditioners in Guthrie’s system. They permit the integration of 
habits within a wide range of environmental change in stimulation, be- 
cause these stimuli are carried around by the organism, )It appears that 
some of this preference dates from the early emphasis of Watson (1907) 
on kinesthesis as the basi 


g abit, a position no 
a 10 a . : 
longer tenable, Such co produced stimuli provide ever- 
present explanati 


annot be inferred from external 
stimulus-response relationships. 


Pag iG toe fiona determinations of this 
t eynolds a), Kimble 1947), Whi 
Schlosberg oe McAllister (1958); For somewhat Jon E wa me 
10 Honzik (1936) found kinesthesis to be one of the least useful of sever 
controls of the maze habit. 
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Why Repetition Brings Improvement 


The reason that practice brings improvement is that improvement, 
and other forms of success, refer to acts, to outcomes of learnings, rather 
than to movements. Guthrie believes that his interest in movements, and 
the prediction of movements, is almost unique among learning theorists; 
others, he says, are interested in goal achievements, and results of one 
sort or another. One difference between him and Thorndike is that 
Thorndike is concerned with scores on tasks, with items learned, pages 
typed, or correct responses attained. (Guthrie is concerned only with 
the movements of the organism, regardless of whether they lead to 
error or success. 

A skill, such as getting the ball into the basket in a game of basket- 
ball, is not one act but many. It does not depend upon a single muscular 
movement, but upon a number of movements made under a number of 
different circumstances. Any one movement may be learned in any one 
trial, but to learn all the movements demanded by the complicated skill 
calls for practice in all the different situations: while near the basket 
and far away, on one side and on the other, with and without a guard 
nearby, Practice is necessary; but it produces its consequences, not in 
accordance with a law of frequency, but according to the simple princi- 
ple of the attachment of cues to movements. The more varied the move- 
ments called for in a given act of skill, and the more varied the cues 
which must become assimilated to these movements, the more practice is 
required. There is no mystery about the length of time it takes to learn 
to operate a typewriter: there are so many keys in so many combina- 
tions, calling for the attachment of a great many cues to a great many 
responses. It is concomitantly necessary to get rid of the faulty associa- 
tions which lead to what, from a product point of view, is an error. This 
is done by having the correct behavior occur to the cue which previously 
gave rise to the faulty behavior. When finally all the cues iba to ac- 
ceptable behavior, the task is mastered. The apparent contradiction of 


Single-trial learning with actual experience of painstaking fumbling 


before success is achieved is resolved when the skilled task is seen to 


be composed of a large number of habits. 


/ Associative Inhibition, Forgetting, and the Breaking 


of Habits y 
The fact of extinction is one of the findings of conditioning experi- 
ments that is in need of explanation. Because cues should remain faith- 
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ful to their responses, Guthrie cannot agree to extinction as a decay due 
to mere non-reinforced repetition. According to him, extinction always 
occurs as associative inhibition, that is, through the learning of an 


incompatible response. His is an interference theor 


y and*hence requires 
no new principles, because the original learning and the interfering 
learning follow the same rules. | 


| Forgetting is explained in the same way. If there were no inter- 
ference with old learning there would be no forgetting. It has been shown, 
for example, that conditioned responses, even though in some respects 
they appear fragile, are actually quite resistant to forgetting. The long- 
lasting character of these conditioned responses is to be understood be- 
cause they represent learning highly specific to cues not confronted in 
daily life. If the learners lived in the laboratory, their responses would be 
subject to more interferences, Guthrie’s positior 


n is but an extreme form 
of the retroactive inhibition theory of forgetting, to be discussed in 
greater detail later. 


UE it is desired to break up a habit ( 


that is, to accelerate its forget- 
ting), it is only necessary to cause oth 


er movements to occur in the 
roblem of locating the cues and 


s time, because many cues may 
lead to the undesirable habit. 


Drinking or smoking after years of practice are action systems which can 
be started by thousands of reminders. . . . I had once a caller to whom I was 


explaining that the apple I had just finished was a splendid device for avoiding 
a smoke. The caller pointed out that I was at that moment smoking. The habit of 
lighting a cigarette was so attached to the finish of eating that smoking had 
started automatically,12 


| Guthrie su 


gests three ways in which activities 
ened: 13 


are commonly weak- 


1. 
have d 


horse to become so startled that it plunges 
or struggles, 
\ 2. The second method is to repeat the signal until the original re- 


H E.g., Hilgard and Campbell (1936); y ; Ski 
rch rh tee a ( ); Wendt (1987); Skinner (1950). 


1$ Paraphrased from Guthrie (1935), Pages 70-73. 
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sponse is fatigued, and then continuing it, so that new responses are 
learned to the signal. The “broncho-busting” of the western ranches fol- 
lowed essentially this technique. 

8. The third method is to present the stimulus when other fea- 
tures in the situation inhibit the undesirable response. One illustration 
given by Guthrie is that of training a dog not to catch and eat chickens 
by tying a dead chicken about its neck. As it struggles to get rid of the 
corpse it develops an avoidance response to chickens at close quarters. 
Another example, illustrating undesirable learning, is the disobedience 
learned by the child whose mother calls him when he is too occupied with 


what he is doing to obey. 


SOME DERIVATIVE EXPLANATIONS AND 
APPLICATIONS 


In his two books The psychology of learning (1935) (1952) and 
of human conflict (1938), Guthrie finds it unnecessary 
al additions to the basic principles of learning in order 
to apply his theory to practical learning problems and to the handling 
of personality disorders.* While the basic principles remain, there are 
some ingeniously derived supplementations which, once accounted for, 
play an important role in the further discussions: Among these are ex- 
planations of the place of motives, the action of reward and punishment, 
and the origin of anticipatory responses as important substitutes for 


ideas and intentions. 


The psychology 
to make any form 


Motives 


The motivational state of the organism, its hunger, thirst, or state 


of comfort or discomfort, has no formal place in Guthries learning 
theory; the motivational state is important only because it determines 
the” presence and vigor of movements that may enter into associative 
totinečfíion.(The motive is important only for wit it causes the organism 

t occur get associated; if a hungry cat acts 


to do. The movements tha ! 
at, her movements are different and so her 


differ > fr a well-fed c: 
een o t. She learns what she does; what she does is 


learning may be different. 7 A 
more peat than what her motivational state happens to be.) In the 


is collaborative books as well: Smith and Guthrie 
Gen game comments APP Y T46), “Guthrie and Edwards (1949), Guthrie and 
, Guthrie a 


Powers (1950). 
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Guthrie and Horton (1946) experiments the cat often did not eat the 
salmon provided in the dish outside the cage. This did not matter, for 
leaving the cage was the important behavior, 

( Motives are, however, important in one of the derivations from the 
basic theory of contiguous association: they are important in providing 
maintaining stimuli, keeping the organism active until a goal is reached. 
The goal removes these maintaining stimuli, and brings the activity to 
an end.!° We shall see how these maintaining stimuli, along with move- 
ment-produced stimuli, tend to keep a series of acts integrated, and ac- 
count for anticipation, for behavior characterized by intent. 


Reward ` 


| While Guthrie believes as everyone else does that rewards influence 
outcomes, his rejection of the law of effect and of the principle of rein- 
forcement in conditioning is based on the position that nothing new 
is added to associative learning by reward except a mechanical arrange- 
ment. This mechanical arrangement, which places reward at the end 
of a series of acts, removes the organism from the stimuli acting just 
prior to the reward) Hence, being removed from the stimuli, 
to these stimuli is preserved intact. Instead of behavior be 
ened by reward, reward preserves it from disintegration. I 
strong before the reward occurred, but, if there had bee 
behavior in the same situation would have 
ing to the reward, being the last act in the 
one favored when the situation next repeats i 
about this. Of an animal's excape from a pro 


the behavior 
ing strength- 
t was just as 
n no reward, 
been changed. The act lead- 
problematic situation, is the 
tself. Guthrie is very explicit 
blem-box he says: 


The position taken in thi 


s paper is that the an 
first escape. This learning is 


15 Guthrie (1942 
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plicated when one examines the totality of Guthrie’s system. The first 
(and primary) role of reward is to remove the animal from the prob- 
lem and thus prevent unlearning. But by the principle of association the 
animal also learns the activity that he carries on in the presence of the 
reward (chewing and salivating to food, for example ), and this behavior 
tends to be invoked by renewed hunger and by any of the cues from 
the problem that may have persisted while the rewarded behavior was 
going on.» 

. . . There is one act, however, to which hunger may remain a faithful 
conditioner, That is the act of eating; and the faithfulness of hunger to this 
association derives from the fact that hunger dies when eating occurs. As Steven- 


son Smith and I pointed out in our General Psychology, elements of the consum- 
matory response tend to be present throughout a series of actions driven by a 
maintaining stimulus.'* i 

Not only do general movements of eating tend to be aroused by hunger 
contractions, but the specific movements demanded by the peculiar nature of 
the food are possibly in evidence. Hence when the rat runs the maze he is ready 
for whatever reward has been received in the past, sunflower seed or bran mash. 


This readiness is an actual muscular readiness. . . «13 

A theoretical problem arises when Guthrie holds at once (a) that 
the reward removes the learner from all the cues prior to the reward, 
and (b) that these prior cues are somehow reward-attached. He could 
resolve this dilemma in one of two ways. He could retreat from the all- 
or-nothing removal of cues by reward: perhaps reward acts by changing 
the situation somewhat, even though not entirely. Or he could define 
the “true” reward as coming at a stage later than eating, perhaps consum- 
mation through food digestion. The animal is still hungry while eating, 
hence hunger produces anticipatory eating responses. Then, however, 
the act of eating does not remove the animal from the cues to which he 
has been exposed, because he has been exposed us eating-response cues 
all along. What finally changes the situation and brings the end of hunger 
(and presumably other cues) is satiation. Although this sounds like 
something of a quibble, the distinction between the ear of contigu- 
ous association and the theory of reinforcement is so slight that only the 
utmost clarity of statement will permit a choice between them. 

One illustration may serve to show how important it is to know 
exactly what the theory states. The experiment of Seiti and Roby 
(1950) shows that the non-nutritive taste of saccharin may serve as a 
reward. The result is used as a refutation of the need-satiation theory 
of reinforcement and as support for the theory of contiguous associa- 


17 Guthrie (1935), pages 151- 
18 Guthrie (1935), page 173- 


152. 


60 THEORIES OF LEARNING 


tion (as well as some other theories). The empirical support for Guthrie 
derives from the fact that the rat’s behavior changes strikingly after the 
saccharin is ingested. The authors conclude: 

It is suggested that elicitation of the consum 


a more critical primary reinforcing 
reduction subsequently achieved. 


matory response appears to be 
factor in instrumental learning than the drive 


While this consummatory response is the final one in 
has not yet taken the learner out of the situation, It must itself be part 
of the series, otherwise there could be no anticipatory responses reflect- 
ing it. What really takes the animal out of the situation? What is really 
reinforcing? Is it the new behavior that takes place after the consumma- 
tory response is over? But if it is this new behavior, does that not occur 
after some sort of stimulus-reduction has taken place? 

Guthrie's general standpoint is clear enough: reward does not 
strengthen prior behavior, it merely protects it from unlearning. But the 
more precise questions about the relation of this position to his inter- 
pretation of anticipatory response remain unanswered, ) 


a series, it 


| Punishment \ 


The primary interpretation of punishment is that of 
learning: you tend to do what you did under the s 


. + + Sitting on tacks does not disco 
learning to do something else than sit. It is not the feelin 


but the specific action caused by punishment that determines what will be 

learned. To train a dog to jump through a hoop, the effectiveness of punishment 

depends on where it is applied, front or rear. It is what the punishment makes the 

dog do that counts or what it makes a man do, not what it makes him feel.20 
What we can predict is th 


, t at the influence of stimuli acting at the time of 
either satisfaction or annoyance will be to re-establish w 


i hatever behavior was in 
evidence at the time.?1 


all associative 
ame circumstances: 


urage learning. It encourages one in 


g caused by punishment, 


If one were to sto 


p with statements such 
that(Guthrie treats re 


à avior would remain intact, 
being protected from new learning by the altered conditions of punish- 
ment as much as by the altered conditions of reward) Sometimes this 
does actually appear to be the case. Running responses leading to charged 
grills, with safety beyond, may prove resistant to extinction.?? 

19 Sheffield and Roby (1950), page 481 

2 Guthrie (1935), page 158, | 8° 481 
$ Guthrie (1935), page 154. 
2? Gwinn (1949), F. D. Sheffield (1949a). 
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The symmetry in treatment of reward and punishment, as implied 
in the foregoing quotations, is somewhat illusory, for “doing what you 
last did” refers to very different parts of the behavior cycle when the 
reference is to reward than when the reference is to punishment. “What 
you last did” that remains in your behavior repertory because of reward 
is what you did just before the reward appeared; “what you last did” in 
the case of punishment refers to what you did just after the punishment, 
leading to escape. Punishment produces “maintaining stimuli” relieved 


by a later movement that brings relief: 


a charged grid, a barefoot boy on a hot pavement, a man 
sitting on a tack have as their goals mere escape from the intense stimulation 
that causes general tension and restlessness as well as specific movement. These 
stimuli continue to act as what Stevenson Smith and I called maintaining stimuli 
until some movement carries the subject away from the source of stimulation, or 


the source of stimulation away from the subject.** 

When these stimuli are removed, we have the circumstances defin- 
ing reward, and so, if that were the whole story, the prior behavior should 
remain intact. This kind of relief from punishment has come to be known 
as escape learning to distinguish between it and avoidance learning. 
To move from escape learning to avoidance learning we require an antici- 
patory response, conditioned to some cue, So that the punishment is 
circumvented. As we saw in relation to reward, Guthrie’s system makes 
provision for anticipatory responses, and these can, of course, be made 
ance learning. The animal merely makes the 
escape response to some cue present at the time of punishment—a cue 
which, fortunately, makes its appearance before the threatened punish- 
ment. Hence what happens at the end of a sequence of acts leading to 
punishment does something in addition to and other than removing the 
organism from the scene: it also sets up some conditioned anticipatory 


. . . An animal on 


use of to explain avoid 


responses. i 
Noxious stimuli may lead not only to escape and avoidance learn- 


ing but to a third subvariety, more properly called punishment, that is, 
to the inhibiting of otherwise rewarded behavior. We may try to stop 
the child’s doing something he enjoys; we may try to break a “bad 


habit.” Punishment of this kind always involves conflict: 
(Guthrie’s position with respect to this subvariety of punishment has 


been aptly summarized by Sheffield: 
if the last response to the punished situation 


(a) Punisheient warkay hat brought on the punishment. 


is incompatible with the response t 


23 Guthrie (1935), page 165. 
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(b) Punishment works only if the cues present when the incompatible re- 
sponse is performed are present when the punished response is performed. 
(c) Punishment that produces only emotional excitement will tend to 


fixate the punished response. 

The main point is that punishment wins out in a conflict with be- 
havior that is otherwise motivated only if there is conflict. The state- 
ment about the presence of cues is not as decisive as it sounds, in view 
of Guthrie’s acceptance of anticipatory responses and movement-pro- 
duced stimuli as cues. In other words, there are always plenty of cues, 
if one needs them. The third point is a useful one: In some cases, what 
appears to be punishing to the trainer may not be punishing to the learner; 


in other cases, punishment, even though annoying, may accentuate 
stereotyped behavior. 


\ Intentions y 


( Conduct is organized into se 


quences in which people make plans 
and carry them out, or 


at least start to carry them out. Guthrie is aware 
of thisyand devotes a chapter to learning with and without intention.?* 


He and Smith had earlier followed the lead of Sherrington 
Woodworth in considering sequences of behavior as composed of 
current or preparatory responses and consummator 
acts appear from the outside to be intentional, for th 
clearly are in readiness for the consequences that 
anticipatory responses or readiness reactions 
to maintaining stimuli, 

The typical case is that of the 


and 
pre- 
y responses.** Such 
e earlier adjustments 
are to follow. These 
are said to be conditioned 


hungry rat running down an alleyway 
to food at the end. The activity is maintained by the internal stimuli 


aroused by food deprivation to which running and eating behavior have 
been conditioned in the past. That is, the rat found food at son 


time after running while hungry. These internal stimuli, plus 
from the runway (if it h 


the running of the animal 


ne previous 
the stimuli 
as been previously a path to food), maintain 


against competing responses, such 


ivation or chewin 
character to the behavior. Al] 


is food at the end of the maze. 


as stopping 


tails are not taken 
“4 F. D. Sheffield (1949), 
a Guthrie (1935), Pages 202-211. 
#8 Smith and Guthrie (1921); Sherrington (1906); Woodworth (1918) 
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as seriously as Hull, for example, takes them. But the paradigm pro- 
vides a way of talking about human intentions and purposes also. 


The essence of an intention is a body of maintaining stimuli which may or 
may not include sources of unrest like thirst or hunger but always includes ac- 
tion tendencies conditioned during a past experience—a readiness to speak, a 
readiness to go, a readiness to read, and in each case a readiness not only for the 
act but also for the previously rehearsed consequences of the act. These readi- 
nesses are not complete acts but they consist in tensions of the muscles that will 


take part in the complete act." 

This statement goes a long way toward the point of view which those 
with very different theories of learning accept. The only feature which 
thin the bounds of Guthrie's theory is that all the readinesses, 
including the readiness for the “previously rehearsed consequences of 
the act,” are said to consist in tensions in the muscles.\This assumption, 
characteristic of the behaviorist position, remains in the realm of con- 


jecture rather than of demonstration. 


keeps it w 


The Control of the Learning Process 

It is part of the charm of Guthrie’s writing that it is closely in touch 
with life and provides amusing but cogent suggestions for meeting the 
problems of animal training, child-rearing, and pedagogy. This practical- 
ity is not a necessary characteristic of the system, for if one seriously 
attempted to provide evidence for the theory he would be buried in 
the midst of the precise movement correlates of measurable stimuli and 
the muscular tension accompaniments of preparatory adjustments. But 
the system is not intended to be taken seriously in that sense. As long 
as a convenient way of talking about things can be found without seem- 
ing to contradict the system, quantitative precision is not essential. It is 
Guthrie’s conviction that scientific laws to be useful must be approxi- 
mately true, but they must also be stated coarsely enough to be teach- 


able to freshmen.“ 

Most of the practica 
and he succeeds in makin 
ing example: 


] advice which Guthrie gives is good advice, 
g it flow from the theory. Consider the follow- 


: -year-old girl complained to a psychologist that for two 
va m ee ap E by a habit of tossing coat and hat on the 
7 rs her daught d the house. On a hundred occasions the mother had insisted 
T bs pre the clothing and hang it in its place. These wild ways were 

at the girl pick up began to insist not that the girl pick 


mother, on advice, 
ae a pik from the floor, but that she put them on, return to the 


27 Guthrie (1935), pages 205-206. 
28 Guthrie (1936a). 
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street, and re-enter the house, this time removing the coat and hanging it 
properly.?° 

Why was this advice given? Behavior is in response to stimuli. 
Hanging up the coat and hat was in response to her mother’s pleading 
and the sight of the clothing on the floor. In order to attach the desired 
behavior to its proper cues, it was necessary to go outside and come into 
the house, so that entering the house became the cue for h 
coat and hat. 


The following statements represent the kind of suggestions which 
recur in his writings: 

1. If you wish to encourage a particular kind of behavior or dis- 
courage another, discover the cues leading to the beh 
In the one case, arrange the situation so that the desire 
when those cues are present; in the other case, arrange it so that the 
undesired behavior does not occur in the presence of the cues. This is 
all that is involved in the skillful use of reward and punishment. A stu- 


dent does not learn what was in a lecture or a book. He learns only what 
the lecture or book caused him to do,% 


2. Use as many stimulus su 
because any ordinary behavior i 
of stimuli. The more stimuli 


anging up the 


avior in question. 
d behavior occurs 


responds. Another way of putti 
in the precise form later to be demanded of us, 


EXPERIMENTS ON THE PUZZLE-BOX 


One of the serious lacks 


gible ground on which t 
strengths and the we 


It is fitting that 
acteristic experiment 


© come to grips with both the 
aknesses of Guthrie’s Position. 


Guthrie and Horton should h 


ave chosen as a char- 
the behavior of the cat in es 


caping from a puzzle- 
29 Guthrie (1935), page 21, 

#0 Guthrie (1942), page 55. 

$1 Guthrie and Horton (1946). 
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box, because this situation had already been the occasion for both ex- 
periment and theory. Thorndike’s classical experiment has already been 
cited (page 16). This gave the send-off to Thorndike’s theory by con- 
vincing him that little of the cats behavior was mediated by ideas and 
much of it controlled by the influence of rewards. Adams’ (1929) repe- 
tition of Thorndike’s work gave much more evidence for ideas. These 
two experiments represent extremes of theory in addition to providing 
detailed descriptions of behavior as background for the Guthrie and 


Horton experiment. 


Adams’ Repetition and Supplementation of Thorndike’s 

Work 

Adams (1929) repeated Thorndike’s experiment as faithfully as 
possible, noting details of the cat’s behavior instead of relying solely on 
time curves. He also designed some other experiments, of which the 
following is typical. : 

A piece of liver is suspended from the top of a wire cage, so that the 
liver rests on the floor inside the cage, loosely held by the thread. A 
hungry cat in the room with the cage, but outside it, sees the liver and 
walks over to the cage. It hesitates for a time and its head moves up 
ugh it is studying the string. Then it jumps on top of 
string in its mouth, raises the liver by joint use of 
eaps down with the stick at the end of the string 
in its mouth, The liver is pulled to the top of the cage, but it is unfortu- 
nately torn off. In another trial the cat is successful in obtaining the 
liver by pawing the stick, taking it in its mouth and jumping down again. 
This time the liver has been more securely fastened, and the response is 


and down as tho 
the cage, catches the 
mouth and paw, and | 


je argues that this kind of behavior is very different from the 
random slashing about described by Thorndike and appears by analogy 
to be ann iS ideas. If the cat has used its past behavior and habits, 


it has used them in a new situation in a remarkably appropriate manner. 


( The Guthrie and Horton Experiment 
i d details of movement rather than 


nae cor 
; their wish to re s 
p; Because of n some other manner, a special problem-box was 


he cat to be fully observed during the period 
i $ d its exact posture to be recorded photographically at 

aa to solution Es ted the release. The release mechanism was a small 

es mans pe of the floor ofa glass-covered cage with exit door 
e set in the mi 


to score achievement i 
designed which permitted t 
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in the front. The animal entered through a starting box and tunnel at 
the rear. If it touched the pole in any manner at all, the front door 
was opened and the animal could escape into the room. A camera was 
operated as the door was opening, so that a photograph of the animal was 
obtained at the moment of release, while it was still in contact with the 
pole. 

In each of three preliminary trials the animal entered the box 
through the tunnel and made its way out the front door, which was left 
ajar, to find a bit of salmon on the table top in front of the box. The 
first of the regular trials followed. During the regular trials, the experi- 
menters kept notes of the animal’s behavior as it entered the bo: 
the tunnel at the rear, and the camera record: 
sition when it struck the release mechanism, 

The results are remarkable for the amount of repetitiousness in each 
cat's successive release. A cat which bites the pole may do so time after 
time; one which has escaped by backing into the pole may back al: 
endlessly in its efforts to escape by the same movement, 
paws or hind paws, or roll against the pole. The cat, in full agreement 
with the theory, learns the method of escape in the first trial and then 
repeats what is essentially the same solution time after time. Some cats 
have several modes of escape which they use at different times, or 


they have one type of escape for a long time-and then shift to another, 
These exceptions to the princi 


are accounted for on the basis of 
stimulating conditions; the resul 
been in the box 
method, some 
fact that the ] 


x through 
ed the exact time and po- 


most 
Others use front 


having 
the release by a familiar 
d the familiar one. The 


iple that such an act will 
& can interfere with it, the cat leaving the 
as he strikes the release mechanism, The fact that the 


cat’s often failing to eat 


ure 6, 
acknowledged. It is 
n his theory.\That is 


This behavior is so cony; 
coherent with all th 
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surely as much as could be wished for by a theorist from a series of 
experiments. 

Why did these cats learn so much more easily than Thorndike’s? 
The answer given is plausible. These cats always found the release mech- 
anism available in exactly the same form, and readily operable. Thorn- 
dike’s release mechanism was more difficult to operate and was probably 
not always in precisely the same position. Hence Thorndike’s cats had to 


i Kat had 
et bie e 


K29  K30 : K-33 K-34 


e ed responses of a single cat escaping from the 

Oe VAN f: Alternative sie Responses K-9 through K-12 illustrate one type of 

porz e box ia ekl by cat K. The remaining pairs of pictures show four other 

ce pe reat by e È cat. Reproduced from tracings of photographs taken 
sciatica! ly at the time of release. From Guthrie and Horton (1946). 


learn a series of habits rather than a single habit. When Guthrie and 
Horton’s cats failed to operate the mechanism by a familiar method 
they. au adopted a new method. Stereotypy was shown because stereo- 


pene Horton are clear that they are nO proposiag a test of 
the cat’s intelligence. They could easily have devised an ie geal in 
which there would have been much less stereotypy. But pe point is 
that as far as the learning of movements is concerned, the animal tends 


to do what it last did in the situation. If the situation forces it to do some- 
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. STEPS TOWARD FORMALIZATION AND ADDITIONAL 
EXPERIMENTAL TESTING 


Since the first edition of this book appeared, some of Guthrie’s 
students have undertaken the task of clarifying and formalizing his 
theory, and of designing new experiments crucially related to it. The 
theory has gained support also from other quarters, and has now become 
a serious contender among contemporary learning theories, as it was there 
suggested that, with such help, it might become. 

Guthrie’s statements have been collected and cast into postulational 
form by Voeks (1950). She goes on to state eight theorems open to experi- 
mental test, and has herself provided some confirming experimental re- 
sults. Her four basic postulates are here quoted verbatim: 


Postulate 1: Principle of Association 


(a) Any stimulus-pattern which once accompanies a response, and/or im- 
mediately precedes it (by 14 seconds or less), becomes a full-strength direct 
cue for that response. (b) This is the only way in which stimulus-patterns not 
now cues for a particular response can become direct cues for that response, 


Postulate 2: Principle of Postremity 


(a) A stimulus which has accompanied or immediately preceded two or 
more incompatible responses is a conditioned stimulus for only the last response 
made while that stimulus was present. (b) This is the only way in which a 


stimulus now a cue for a particular response can cease being a cue for that 
response. 


Postulate 3: Principle of Response Probability 38 


The probability of any particular response’s occurring (P) at some specified 
time is an increasing monotonic function (x) of the proportion (N) of the stimuli 
present which are at that time cues for that response, (P = N*), 


Postulate 4: Principle of Dynamic Situations 


The stimulus-pattern of a situation is not st: 
modified, due to such changes as result from the s 
cumulation of fatigue products, visceral ch 


the subject, introduction of controlled or 
present. 


atic but from time to time is 
ubject’s making a response, ac- 
anges and other internal processes of 
uncontrolled variations in the stimuli 


These four postulates (and the theorems related to 
isolated responses as the essential core of Guthrie’s theory. Unfortunately, 
Voeks’s theorem system has not yet been extended to deal systematically 
with the puzzling problems posed by various arrangements of reward and 


them) deal with 


36 Some of the subscripts have been omitted from the s ls i 
necessary here, ymbols in Postulate 3 as un- 
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punishment, with anticipatory responses, with the integration of acts 
through movement-produced stimuli. Nevertheless, the start made by 
Voeks is a very useful one, and her experimental tests are cogent. 

Her first experimental test ° (completed before the system was 
formalized, but consistent with the formalization) deals with the pre- 
diction of behavior in a maze. She studied the individual responses at 
man subjects learning a raised relief finger 


each choice-point of 57 hu 
maze and a punch-board maze. She tested which of two predictions was 


the more accurate: (1) prediction based on the frequency of prior 
choices at the point of choice, and (2) prediction based on postremity 
(i.e., the last choice that was made). Postremity won out easily. The 
success of prediction by postremity was most striking when the prediction 
p JP y 5 j: 
disagreed with that on the basis of frequency. For 56 of the 57 subjects, 
postremity predicted better than frequency for those choices in which 
the predictions disagreed. The difference was statistically highly signifi- 
cant, These results are, of course, in agreement with Guthrie’s theory, 
as stated in Voeks’s Postulate 2. 

The major puzzle is why the maze was learned at all. Why was not 
the last trial chiefly a repetition of the first, with some minor fluctuations 
according to Postulate 4? The explanation is not very difficult for the 
high-relief maze, for the last response was always leaving the choice- 
point by the true path. The explanation is more difficult for the punch- 
board maze, because 4 “njon-correction” method was used, That is, if 
the “wrong” hole was punched, this was signalled to the subject, but he 
went on to the next choice without punching the “right” hole. Let us 
look at Voeks's explanation of the learning of the punch-board maze: 

‘ Now, in the learning of the punch-board maze, S may insert his 
stylus into the wrong hole of a pair, e.g., pair 8. A stimulus then is presented 
(the sound of the buzzer in our experiment, E saying wrong in others, ete.) 
which may cause S to withdraw his stylus and make additional responses on that 
trial to that choice point. These responses may involve, for instance, the S’s say- 
ing to himself “Eight, not this hole, that one is right,” or even making incipient 
or possibly overt movements toward the other correct hole while looking at the 
at If this has been the case; the next time S comes to the pair of holes, he will 
say “Eight” los instructed), and it is expected he will start toward the incorrect 
eae ie his stylus back, say “No, not this hole; that one is right,” and then 
make the previously established conditioned response to the stimulus “That-is- 

5 se of inserting his stylus into the hole at which he has just 


a ii on 
right,” i.e., the respo t” Thus another error may be eliminated. This 


looked and said “That is right’ T™ 3 
again is in accordance with the principle of postremity.?" 


37 Voeks (1948). y 
38 Voeks (1948), page 505. 


especially 
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In fairness to the experimenter it must be pointed out that this 
change of response would have been recorded as a failure of prediction 
by postremity, although on the basis of her theoretical analysis it was 
no failure at all. Hence her successful predictions by postremity are 
on the conservative side. The explanation offered for the actual learn- 
ing (the elimination of an error) seems to come very near to an ex- 
planation according to knowledge of results or effect, and gives some- 
what specialized meaning to the concept of “doing what you last 
did.” 

The second experiment? studies conditioned eyelid responses of 
human subjects, measuring both the occurrence of the responses and 
their amplitudes. The question asked is this: “Is a stimulus-response 
connection gradually strengthened by reinforcement, or is a stimulus- 
response connection established suddenly, in all-or-none fashion?” Hull 
is said to favor gradual strengthening, Guthrie all-or-none appearance of 
responses, according to Voeks’s postulates 1 and 2. 

Two tests were made of the theory of gradual strengthening, the 
first studying amplitude, the second frequency. Beginning with the first 
conditioned response (CR), succeeding responses for each subject were 
divided into fourths. For only 6 of 32 subjects was there a progressive in- 
crease in amplitude of CR from quarter to quarter. In general, increases 
from quarter to quarter were not statistically significant, although for 
25 of the 82 subjects the last CR was larger than the first (a significant 
difference), Amplitude changes thus gave little support to Hull's theory 
of gradual increase in strength with repeated reinforcement, The fre- 
quency results were more strikingly in Guthrie’s favor, bec 
was acquired in all-or-none fashion. No subject showed 
frequency of CR from quarter to qu 
the subjects gave CR’s on every trial 
only a few lapses for the others, 

How do Voeks’s re 
published for CR’s? It i 


ause a response 
an increase in 
arter following his first CR. Half 
after their first one, and there were 


sults square with the many acquisition curves 
s well known that curves for groups of subjects 
show characteristics very different from curves for individual subjects, 
Voeks has very convincingly shown that response acquisition may be 
all-or-none, and yet a group learning curve will show a gradual slope, 
The accompanying learning curve (Figure 7) results, for example, when 
the probability of response is plotted as a group function for 15 sub- 
jects all of whom had jumpwise curves, that is, all of whom responded 


consistently with CR’s after the first CR was made. The form of the curve 
39 Voeks (1954). 
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is determined solely by the trial on which the first CR happened to ap- 
pear for different subjects.*° 

Voeks predicted jumpwise curves of learning under the very uni- 
form conditions of stimulation that she arranged to produce, and her 
predictions were well borne out. A jumpwise curve is, of course, ideal for 
a test of the postremity principle: if there is an unbroken run of no 
responses, the prediction by postremity is perfect during the course of 
this run; if this is followed by an unbroken run of conditioned responses, 
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effect of combining individual learning curves. The group 
he curves of 15 subjects all of whom had jumpwise 
followed by consistent responding. After Voeks 


FIGURE 7. Misleading í 
curve results from combining t 
curves, i.e a run of no responses 


(1954), page 145. 
prediction by postremity is perfect then, too. The prediction fails for only 
one trial (that on which the conditioning occurs! ). By the test of pos- 
tremity prediction, Guthrie’s theory was beautifully confirmed. Of the 
total predictions made, 84.6 per cent were correct. 7 
What are we to do with the fact that an unconditioned response of 
the eyelid did occur (within % second of the conditioned stimulus) on 
every trial from the beginning? Does not Voeks’s Postulate 1 lead us to 
expect a conditioned response right away? Why does the CR wait for 
several trials (as many as 10) before it makes its appearance? Perhaps 
there is something to the group curve after all, showing that the proba- 
bility of giving the first CR increases with the number of reinforcements. 
P T 7 Joeks’s i i ; tei 
ct Nota naval conn re en publ for sing Teme dogs, 


A individual curves of eyelid conditioning (the response Voeks 
iy an meet Eag pags Coons Hg, and Wend (188) page O% 
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Enough of the current studies favor Guthrie’s theory to insure that 
it will be given serious attention as an active contemporary theory of 
learning. 


ESTIMATE OF GUTHRIE’S POSITION 


Guthrie’s Position on Typical Problems of Learning 


By way of comparative summary, how Guthrie stands on the several 
representative problems of learning will be briefly stated. 

1. Capacity. Problems of capacity are not formally treated, although 
species differences are recognized and allowance is made for maturation 
as a determiner of many classes of acts.‘ Presumably any response which 
the organism can make can become associated with any stimulus to which 
he is sensitive—a generalization about the possibility of learning which is 
reminiscent of what Thorndike says about associative shifting. If pressed, 
Guthrie could find a basis for differences in capacity both in the differ- 
entiation of movement and in the discrimination among proprioceptive 
cues. All animals are not equally versatile and equally equipped with 
receptors. 

2. Practice. Practice assimilates and alienates cues, until a whole 
family of stimulus combinations comes to evoke a whole family of re- 
sponses which lead to the outcome socially described as successful per- 
formance. Because skill represents a population of habits, learning ap- 
pears to accumulate with repetition, although basically each individual 
habit is learned at full strength in a single repetition. 

8. Motivation. Motivation affects learning indirectly through what 
it causes the animal to do. Reward is a secondary or derivative principle, 
not a primary one as in Thorndike’s system. Reward works because it 
removes the animal from the stimulating situation in which the “correct” 


response has been made. It does not strengthen the “correct” response, 
but prevents its weakening becau 


tached to the cues which led to 
relative strengthening, 

Punishment does 
fects for learning are 
according to the pri 


response in the situ: 
cases: 


se no new response can become at- 
the correct response. Thus there is a 
because responses to other cues get alienated. 

at once several different things, In general, its ef- 
determined by what it causes the organism to do, 
neiple that the best predictor of learning is the 
ation that last occurred. We may distinguish four 


44 Guthrie (1935), pages 18, 38. 
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a. Mild punishment may be merely exciting, and enhance ongoing 
behavior without disrupting it. 

b. More intense punishment may break up a prior habit by leading 
to incompatible behavior in the presence of the cues for the earlier habit. 

c. Continuing punishment acts like a drive, producing maintaining 
stimuli that keep the organism active until it finds relief. Then the conse- 
quence for learning is really that of reward: the act that leads to safety 
is rewarding because it terminates the punishment and by removing the 
maintaining stimuli protects from unlearning the activities carried on in 
the presence of those stimuli. 

d. Stimuli that have previously accompanied the punishment pro- 
duce behavior that formerly occurred following the punishment itself. 
Here we have an illustration of anticipatory response, essential to avoid- 
ance behavior. The cues to avoidance must earlier have been present 
at the time of punishment for this anticipation to occur. 

4. Understanding. Concepts like “insight” are handled in a derisive 
is recognized that learning with foresight of its 
cur. The tendency is to talk down such learning, 
and to emphasize the mechanical and 


repetitious nature of most human as well as animal learning. Such learn- 
ing with intention and foresight as does occur is explained on the basis 
of conditioned anticipatory or readiness reactions, based upon past ex- 
perience and hence not contradicting association principles. 

5. Transfer. Learning transfers to new situations because of com- 
mon elements within the old and new. In this the position is rather like 
Thorndike’s. Stress is laid, however, on the identity being carried by way 
of common responses evoked, the proprioceptive stimuli being sufficiently 
similar from responses to a variety of stimuli to evoke common condi- 
tioned responses. The emphasis upon movement-produced stimuli thus 
represents Guthrie’s supplementation to Thorndike. 

Because of his principle of responses being conditioned to all ad- 
ventitious contiguous stimuli, Guthrie expects rather little transfer and 


is, in fact, rather extreme about it. The only way to be sure to get desired 
" is to practice in that new situation as well. 


behavior in a new situation 1 
To be able to perform in a variety of situations, you have to practice in 


a variety of situations. 

6. Forgetting. Le 
with by new learning. Hence 
responses which replace the 


manner, although it 
consequences may 0c 
however, just as Thorndike did, 


arning is said to be permanent unless interfered 
all forgetting is due to the learning of new 
old responses. It may take place gradually 
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for the same reason that skills may be acquired gradually: remembering 
depends upon many habits to many cues, and subhabits may drop out 
gradually as subcues become attached to new responses. 

By contrast with Thorndike, Guthrie is an avowed behaviorist who 
makes it a matter of some importance to get rid of subjective terms, to 
refer, for example, to inner speech instead of to thinking. The emphasis 
upon movement-produced stimuli is part of this older behaviorist tradi- 
tion which Guthrie carries into the present. While an orthodox behavior- 


ist in these respects, his is an informal behaviorism, with little of the 
brittleness of earlier Watsonianism. 


Invulnerability of the Theory a Cause for Skepticism 


The uncertainty that exists in practically all learning experimenta- 
tion makes the fact-minded psychologist suspicious of a finished system 
at this stage of our knowledge. While scientific truth must eventually 
have exceptionless validity—if its laws are truly lawful—the history of 
our most advanced sciences shows that their theories move by successive 
approximations, and the most advanced theories do not emerge full blown 
from the head of the theorist. Even as loose a system as Thorndike’s went 
through revisions on the basis of evidence regarding the effects of pun- 
ishment; Hull's system, the most carefully worked out of any, was con- 
tinuously being revamped to meet experimental fact. One of the sources 
of uneasiness about Guthrie’s system lies in its assured answers to the 
problems of learning—answers that have remained unchanged through a 
quarter of a century of the most active psychological experimentation 
we have ever known, Experimental controversies finally get resolved as 
we learn more about the independent variables that modify the measured 
consequences. No matter how these issues get resolved, Guthrie’s system 


remains unchanged. Either the theory is a miraculously inspired one or 
it is not stated very precisely, and hence is not very sensitive to experi- 
mental data. 

Criticism of Guthrie’s position is rendered somewhat difficult by the 
nature of the task which he sets himself. It is not qu 


he believes his system to have any responsibility 
The principle of associatio: 


` l s n or conditioning is not an explanation of any 
instance of behavior. It is merel 1 


i r. ] y a tool by which explanation is furthered. A tool 
is not true or false; it is useful or useless.15 


The paradox of the theory lies in the kind of sophistic 
in this statement, combined with a casualness which baffle 


45 Guthrie (1935), page 232, 


ite clear whether 
with respect to details. 


ation implied 
s the critic, It 
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is not unfair to ask of the position that it substantiate its claims, which 
are, indeed, far-reaching and are competitive with the claims of others. 

Of the opposing points of view to which he objects, Guthrie gener- 
ously admits that their phenomena and the terms used to describe them 
are correct and useful for certain purposes.*° What he says in essence is 
that what they do is all right for limited purposes, but it is not very help- 
ful so far as understanding ordinary learning is concerned. All other 
writers are said to be concerned with outcomes, in the form of success or 
goal achievement. Under their theories, he says, the teacher must be a 
mere passive element in the situation and cannot be told how to influence 
the outcome.‘ This non sequitur indicates a failure to take opposing 
claims seriously. Insight, if not predicted on the basis of past experience, 
must be in the category of luck, and hence lie outside of science.** While 
many of Cuthrie’s observations are astute, a cavalier handling of serious 
alternatives to his own gives an impression of immutability inappropriate 


in a growing science. 


The Simplicity of the Theory May Be Illusory 

uch of the fascination of Guthrie’s theory rests upon his 
a wide range of phenomena from the single principle of 
ciation. Parsimonious scientific theories are at- 
Itimate in reductionist theories. Although Voeks 
s, with proper definitions of 


Certainly m 
ability to deduce 
One-trial contiguous asso 
tractive, and this is the u 
found it convenient to state four postulate š 
the stimuli and responses entering into association, she would have found 


one to be enough, with the other three as: corollaries: 

A painstaking search through Guthrie s writings for careful defini- 
tions of stimulus and response, for distinctions between observables and 
constructs, for statements taking the form of predictions and those tak- 
ing the form of a posteriori explanations, has led recent critics to con- 


clude: 
inci itioni ich he expands seem to have a 
Nhi neiples of conditioning whick pands a 
E cae i Praid be desirable in a theoretical formulation of behavior, a 
closer be lala reveals that a formidable set of additional assumptions and con- 
structs are required if his theory is to possess any real applicability to experi- 
mental data.*" 

EN Tolman, Hull, Skinner and others, Guthrie (1952), 
fee Sar ace hed Mec new to the revised edition of the book, although the 
Eon is unchanged from views expressed much earlier, e.g., Guthrie (1942), 
page 57. 

3T Guthrie (1942), page 57. 


48 Guthrie (1935), page 25. 
49 Mueller and Schoenfeld (1954), page 377. 
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It is undoubtedly true that many reviews of Guthrie in the literature have 
mistaken incompleteness for simplicity.°° 


Guthrie’s Contributions 


Despite his formal concern with the more hard-boiled aspects of 
scientific observation and scientific logic, Guthrie is without a peer in 
the use of anecdote and illustration to make pertinent comments about 
the activities of everyday life, including the symptoms found in the 
psychological clinic. This complicated material is talked about in dramati- 
cally simple terms, and his theory makes this kind of talk possible. The 
difference between plausibility and proof is one of the differences that 
hounds psychological science, and a theory that is not sensitive to experi- 
mental data will not be sensitive either to those clinical patients who do 
not get well following good advice. Nevertheless, there is something to 
be learned from Guthrie's type of psychologizing, and it is not a sufficient 
explanation to remark that he is a wise and experienced person, a shrewd 
observer of human beings. His kind of psychologizing is appealing enough 
to have led many promising young men and women to enter upon careers 
in psychology—a contribution to the field not to be overlooked. 

At the experimental level, Guthrie’s greatest contribution is to call 
attention to the large element of repetitiveness and stereotypy in behavior 
when the opportunities are favorable to such monotonous behavior. The 
tendency toward smugness is easily overlooked, especially by those who 
choose to study the changes in behavior called learning. 

There are enough well-trained younger psychologists friendly to 
Guthrie’s ideas to undertake the task of increasing the rigor of his sys- 
tem beyond that yet attained. The 


promise of the system is great enough 
to make the effort worth while. 
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view of the logic of science an 
than the present chapter: 


Murren, C. G., Jr., and ScH 
W. K. Estes and others. M 


critical review of Guthrie’s contributions from the point of 
d system-making goes into greater analytical detail 


OENFELD, W. N. (1954) Edwin R. Guthrie. In 
fodern learning theory, 345-379. 


Chapter 4 


SKINNER’S OPERANT 
CONDITIONING 


i he a series of papers beginning in 1930, B. F./Skinner proposed a formu- 
lation of behavior which arose out of observations of animal perform- 
ance in a type of experiment that he invented: the bar-pressing activity 
of a rat in a specially designed box called (by others) the Skinner box.) 
The experiments and theories were first brought together in book form in 
“ his Behavior of organisms (1938). Experimentation has continued, much 
of it using a new organism (the pigeon) and a new equivalent of bar- 
pressing (the pigeon’s pecking at a spot). The prineiples have become 
the basis for two textbooks of general psychology, Keller and Schoen- 
feld’s Principles of psychology (1950) and (Skinner's Soience and human 
behavior (1953). As an avowed behaviorism making use of condition- 
ing principles, the system can be understood as a development of the 
ground broken by Thorndike, just as Guthrie’s system can be so under- 
stood. But if Guthrie's theory is thought of as a development of Thorn- 
dike’s associative shifting! Skinner's operant conditioning is a development 
more along the lines of _Thorndike’s learning by selecting and connecting 
under the law of effect. As we shall see, Skinner acknowledges two kinds 
of learning, just as Thorndike did, but, 1 
emphasis upon th 
consequences, 


ike Thorndike, he places more 
at kind of learning which is under the control of its 


| RESPONDENT AND OPERANT BEHAVIOR) 


The greatest break with conventional stimulus-res 
within Skinner’s system is the distinction between res 
ant behavior:) Conventional stimulus-response psychology enforced the 
dictum “no stimulus, no response” by assuming the presence of stimuli 
when a response occurred though no stimuli were identifiable. Although 

: 82 


ponse psychology 


pondent and oper- 
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it was often convenient to talk about “random” or “spontaneous” re- 
sponses, it was not doubted that stimuli were present to elicit them, if 
the experimenter only had means of detecting them. Skinner finds this 
method of forcing facts both undesirable and unnecessary. He proposes 
that two classes of response be distinguished, a class of elicited responses 


and a class of emitted responses. ` 


( Respondent and Operant Distinguished 
\ Responses which are elicited by known stimuli are classified as 
respondents. Pupillary constriction to light and the knee jerk to a blow 
on the patellar tendon serve as convenient illustrations. There is a sec- 
ond class of responses which need not be correlated with any known 
stimuli. These emitted responses are designated operants, to distinguish 
them from respondents. )\While the conventional treatment of such re- 
sponses is to consider them as respondents with unknown stimuli, Skinner 
holds the conviction that the stimulus conditions, if any, are irrelevant 
to the understanding of operant behavior. Because(operant_behavior is 


not elicited by recognized stimuli, its strength cannot be measured ac- 


cording to the usual laws of the reflex, which are all stated as functions 


; , r i se is used as n measure of strength. \ 
ory nd rate of response is used as ; ae 
i prr and ni does, acquire a relatian to pelar ie 


lation. In that caie it hecomes a diseriminuted operant; the stimulus Dbe- 
comes an occasion for the operant behavior, biit fs not ati eliciting stimi 


lus as in the case of a true reflex) Skinner formerly extended the term 
“reflex” to cover the operant, although this proved somewhat awkward 
because several of the laws of the reflex do not applyCA simple illustration 
of an operant co-ordinated with a stimulus would be a reaction-time 
experiment as commonly conducted in the psychological laboratory. The 
tortelation between stimulus and response may easily be changed; y by 
instructions to depress the key instead of lifting the finger from it) The 
ponse to changes in stimulus intensity is 


relationship of latency of resp i ne 
very TM for a discriminated operant from what it is for an elicited 


response, say the lid reflex to sound—a respondent. Such a comparison 
between a discriminated operant end a respondent has 1 a by 
Peak (1933). Although the characteristics of voluntary and're r it 
responses both were changed in response to a in laua ness, 
the magnitudes of changes differed tor bepo 3 $ : ike r fio 

\ Most human behavior is operant in character. he behavior of eat- 
N r og a letter, shows but little of respondent 
ing a meal, driving a car, syne p Ral anae arant behavior is 
character, The emphasis which Skinner places upon operan havior is 
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appropriate if he is going to have something to say that applies in prin- 
ciple to the commonest forms of human (and animal) behavior. 


| Two Types of Conditioning 
‘Related to the two types of response there are said to be two types 
of conditioning. 

The conditioning of respondent behavior is assigned to Type S, be- 
cause reinforcement is correlated with stimuli. The conditioned stimu- 
lus (e.g., a tone) is presented together with the unconditioned stimulus 
(e.g., food) and thus comes to elicit the response (e.g., salivation). The 
reinforcing event that interests Skinner is the presentation of the un- 
conditioned stimulus, not the response to it.Pavlov’s classical condi- 
tioning experiment is said to be of Type S. Its two laws are the Law of 
Conditioning of Type S, and the Law of Extinction of Type S.1\The 
law of conditioning makes such conditioning depend upon the approxi- 
mate simultaneity of stimuli. The evidence for the existence of Type S 
conditioning is actually quite slim, because much that by experimental 
arrangement conforms to classical conditioning is better understood as 
a consequence of factors other than contiguity of stimulus and response. 
Hull, as we shall see, denied the existence of conditioning of Type S; 
Skinner admits that it does not appear experimentally in pure form.? 
But(Skinner does not attribute much importance to Type S Jin any case. 
Whether or not there is such conditioning does not matter too much to 
his system. 

y Type R he believes to be much the more important. (This is the con- 


ditioning of operant behavior, and the letter R is used to call attention 
to the important term in the correlation 1 with reinforcement. In this case 
it is a response which is correlated with reinforcement. ‘The experimental 
example which he originally used was lever-pressing, This response may 
be strengthened by following it with food. It is not the sight of the lever 
which is important; it is the pressing of the lever: )The conditioned re- 
sponse does not resemble the response to the reinforcing stimulus 
relationship to the reinforcing stimulus is that it causes it to appear. In 
operant conditioning, conditioning of Type R, reinforcement cannot fol- 
low unless the conditioned response appears; reinforcement is contingent 
upon response. The two laws of Type R are not unlike those of Type S, 
including a law of conditioning and a law of extinction“The law of con- 
ditioning of Type R may be compared to Thorndike’s law of effect: If 


; its 


1 Skinner (1938), pages 18-19. 
2 Skinner (1938), page 238. 
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the occurrence of an operant is followed by presentation of a reinforcing 
stimulus, the strength is increased.* Note that the reinforcing situation is 
defined by its stimulus; nothing is said about satisfying after-effects or 
about drive reduction. What gets strengthened is not a stimulus-response 
connection, because the operant requires no stimulus; this statement is 
unlike Thorndike: 

This dependence upon the posterior reinforcing stimulus gives the term 


operant its significance. . . . The operant . . . becomes significant for be- 
havior and takes on an identifiable form when it acts upon the environment in 


such a way that a reinforcing stimulus is produced.‘ 

The mechanical arrangement under which Type R conditioning is 
usually demonstrated is that suggested by the quotation, a situation in 
which the response of the organism produces the reinforcing agent. This 
is what has been called instrumental conditioning to distinguish it from 
the arrangements of classical conditioning." l 

One suggestion offered quite tentatively by Skinner © is that condi- 
tioning of Type S may be limited to autonomic responses, Type R to 
skeletal behavior. The crucial question is whether Type S occurs at all; 
if it does, it may well be limited to autonomic responses. That Type R 
occurs is evident, and most skeletal responses, including those obtained 
under the arrangements of classical conditioning, can easily be shown 
to conform to the pattern of discriminated operants. Actual experiments 
in which autonomic conditioning takes place (salivation, galvanic re- 
sponse) are full of indirect accompaniments of Type R. When the cir- 
cumstances seem almost ideal for demonstrating Type S conditioning, as 
in attempts to condition pupillary constriction by presenting a tone 
along with a light, it is extremely difficult to obtain any conditioning at 


all.? 
REINFORCEMENT OF AN OPERANT 


quantitative relationships within operant con- 


In order to get at the ù : 
i = d a special apparatus suitable for use with 


ditioning, Skinner designe 


3 Skinner (1938 } page = 

4Skinner (1938), page = 

5 Hilgard and Marquis (1940), pa 
Ferster (1953). 9 ia 

6 Skinner (1938), page +44; Eeoa ë b t Hi 

7 llary conditioning have been most unsuccesstul, egn 
ay Recent attempts Pr (1840), Hilgard, Dutton, and Helmick (1949), Young 
(1954 i ra a ccess has been reported, however, by Kotake and Mihama (1951) 
and ee ap a Kotake (1954), so the matter is still open. “Pure cases of Type S 
a wll oe es find. The heart-rate conditioning of Notterman, Schoenfeld, 


-onditioni hard a ‘ ae 
ie 1952 ) includes the possibility of operant intermediaries. 


ges 51-74. For Skinner’s laboratory methods, see 
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white rats. It consists essentially of a darkened sound-resisting box in 
which the rat is placed. There is a small brass lever within the compart- 
ment which, if pressed, delivers a pellet of food. The lever is connected 
with a recording system which produces a graphical tracing of the num- 
ber of lever pressings plotted against the length of time that the rat is 
in the box. By a carefully controlled handling of the animals, remarkably 
consistent and “lawful” results can be obtained. Modifications of the ex- 
periment can be introduced so that food is not delivered every time the 
lever is depressed. The consequences of doing this and of making other 
changes in the situation have been systematically reported. 

The evident consequence of the reinforcement of an operant is to 
increase the rate with which the operant response is emitted. This in- 
crease in rate of responding is an indicator of the increased probability 
of response, which is thé appropriate measure of operant strength, \sub- 
stituting for such measures as amplitude and latency, more appropriate 
for measuring the strength of a respondent. While the expression reflex 
strength was originally used for both respondent and operant behavior,’ 
in Skinner’s later writings he tends to restrict the word “reflex” to its 
ordinary meaning of elicited reflexes (salivation, knee-jerk), and to con- 
ditioned reflexes based on them, that is, to respondent behavior.® 


Ê Extinction as a Measure of Operant Strength P 


{When an operant is regularly reinforced, the rate 
interrupted by the activity of eating. Because the time- 
eating does not occur during extinction, responses withi 
better than responses during conditioning as me 
quences of reinforcement. fwo measures of responses within extinction 
are commonly used: rate of responding, and the total number of re- 
sponses before responding returns to its normal rate prior to conditioning, 
The total number of responses during extinction now often described as 
resistance to extinction was formerly called the reflex reserve, a figure 
of speech to describe a kind of reservoir of responses ready to be emitted 
during extinction. Skinner no longer believes the concept of a reflex re- 
serve to be very useful, although he appears to reject it because of his 
later interpretation of appropriate scientific concepts rather th: 
of any change in the factual relationships described.2? 


of responding is 
consuming act of 
n extinction serve 
asures of the conse- 


an because 


8 Skinner (1938), page 20. 
® Skinner (1938), pages 45-59.. 
1 Skinner (1938), page 26. 


H Skinner (1950), page 203. The original concept w: 


as not without its critic. 
ae fet Jut its critics, egs 
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x (A single reinforcement suffices to produce a number of conditioned 
operant responses in extinction. (After receiving a single pellet of food 
following one pressing of the bar, a rat may respond 50 or more times, 
yielding a typical extinction curve. Additional reinforcements add slowly 
to the number of responses during extinction. An extinction curve follow- 
ing a single reinforcement and one following 250 reinforcements are re- 
produced in Figure 8. The curves are cumulative ones, not to be con- 
fused with learning curves as usually plotted. As the curve levels off 
it means that responses have stopped; as the curve is constructed it is not 


possible for it to fall. 
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Q D «ction of lever-pressing by rats following a single reinforcement 
ot 8; Extinction “ements. Although several times as many responses are 
emitted PEAT reinforcements as after a single reinforcement, it is evident that the 
total number of responses in the “reserve does not increase in direct proportion to 
the number of reinforcements. Data from Skinner (1933) and F. S. Keller and 
A. Kerr (unpublished data), replotted from Skinner (1988), pages 87 and 91. Both 
clive’ have been moved pack so that time is here shown as measured from the first 


response, 
able influence of a single reinforcement leads Skinner 
a 


were able to isolate a single operant, we might 
find an instantaneous change to a maximum probability.“* This con- 
jecture, we may note, is very similar to that of Guthrie with respect to 
one-trial learning. The further conjecture that actual conditioning is 


12 Skinner (1953a), page 68. 


The consider 
to conjecture that, if we 
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found to be gradual because several component operants are involved is 
also not unlike Guthrie. The residual difference is that the single event 
for Skinner is a reinforcement, whereas for Guthrie it is a mere occur- 
rence. 


The importance of single reinforcements (“one-trial learning”) is 
brought out in two arrangements which have come to prominence in 
Skinner's later work. The first of these is the development of “supersti- 
tious” behavior. The second is the control of behavior through successive 
approximations. 

If there is only an accidental connection between the response and the ap- 
pearance of a reinforcer, the behavior is called “superstitious.” We may demon- 
strate this in the pigeon by accumulating the effect of several accidental contin- 
Sencies. Suppose we give a pigeon a small amount of food every fifteen seconds 
regardless of what it is doing. When the food is first given, the pigeon will be 
behaving in some way—if only standing still—and conditioning will take place. 
It is then more probable that the same behavior will be in progress when the 
food is given again. If this proves to be the case, the “operant” will be further 
strengthened. If not, some other behavior will be strengthened. Eventually a 
given bit of behavior reaches a frequency at which it is often reinforced. It then 
becomes a permanent part of the repertoire of the bird, even though the food 
has been given by a clock which is unrelated to the bird’s behavior. Conspicuous 
responses which have been established in this way include turning sharply to 
one side, hopping from one foot to the other and back, bowing and scraping, 
turning around, strutting, and raising the head. The topography of the behavior 
may continue to drift with further reinforcements, since slight modifications in 
the form of response may coincide with the receipt of food. 

If, for example, three reinforcements were always required in order to 
change the probability of a response, superstitious behavior would be unlikely, 
It only because organisms have reached the point at which a single contingency 
makes a substantial change that they are vulnerable to coincidences,14 

Because the or 
the reinfo; 


and ingenuity in 
thod. Some of the reports have 


from a vending machine, another show- 


> a third in which a 
m a pedal-operated refuse can, The 


animal-training possibilities are almost unlimited.2> How do single rein- 
13 Skinner (1953a), page 85. 
14 Skinner (1953a), page 87. 
18 The accounts have appeared many places, such as: Life, M 
> : » May 31, 1937; Sci 
Newsletter, June 17, 1950; Newsweek, May 7, 195 s Beale goa 
(1951), Skinges (1951), y 1. See also Breland and Breland 


SKINNER’S OPERANT CONDITIONING 89 
forcements bring such diverse behavior under control? Here is one demon- 
stration: 


An effective classroom demonstration of the Law of Effect may be arranged 
in the following way. A pigeon, reduced to 80 per cent of its ad lib weight, is 
habituated to a small, semi-circular amphitheatre and is fed there for several days 
from a food hopper, which the experimenter presents by closing a hand switch. 
The demonstration consists of establishing a selected response by suitable rein- 
forcement with food. For example, by sighting across the amphitheatre at a scale 


on the opposite wall, it is possible to present the hopper whenever the top of 
iven mark. Higher and higher marks are chosen 


the pigeon’s head rises above a gi L y a 
until, within a few minutes, the pigeon is walking about the cage with its head 
held as high as possible. In another demonstration the bird is conditioned to 


strike a marble placed on the floor of the amphitheatre. This may be done in a 
few minutes by reinforcing successive steps. Food is presented first when the 
bird is merely moving near ‘the marble, later when it looks down in the direction 


of the marble, later still when it moves its head toward the marble, and finally 
when it pecks it. Anyone who has seen such a demonstration knows that the Law 


of Effect is no theory. It simply specifies a procedure for altering the probability 


of a chosen response.*® 


‘Positive and Negative Primary Reinforcers 
y its effects. Any stimulus is a reinforcer if 


( A reinforcer is defined b 
\The stimuli that happen to act 


it increases the probability of a response 


all into two classes: ar 
rcer is a stimulus which, when added to a situa- 


obability of an operant response. Food, water, 


as reinforcers f 
“1. A positive reinfo 
tion, strengthens the pr 


sexual contact, classify as positive reinforcers. 


M4. A negative reinforcer is a stimulus which, when_removed from a 

situation, strengthens the probability of an operant response. A loud 

> ; i 

noise, a very bright light, extreme heat or cold, electric shock, classify as 
ie 


negative reinforcers. i , 
Notice that(the effect of reinforcement is always to increase the 


probability of response. unishment is something other than negative 
reinforcement as here de “ied. While a reinforcer is defined by its effects, 
this is not true for punishment. (Punishment, according to Skinner and 
his followers, is defined as an experimental arrangement whose effects 
remain to be investigated empirically. The arrangement is the opposite 
of reinforcement (although the effects are not opposite), so that two 

ntation of a negative reinforcer, and (2) 


main cases arise: (1) the prese! i 

the removal of a positive reinforcer. We shall return later to a considera- 
tion of the consequences of punishment. eee . 

Why is a reinforcer reinforcing? One possibility is that reinforcement 


16 Skinner (1950), page 200. 
17 Skinner (1953a), page 73. 
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reduces a state of deprivation, that is, it meets a need, as when a hungry 
animal eats. The hungrier a bird, the greater the rate of its responses as 
a consequence of food reinforcement. Skinner recognizes, however, that 
reinforcement does not always reduce deprivation. Conditioning may oc- 
cur before any substantial change can take place in the internal state 
of the learner. 


All we know is that the type of event which reduces deprivation is 
also reinforcing. 
Skinner rather tentatively accepts an explanation in terms of evolu- 


tionary biology, but he does not find it of much help in the detailed func- 
tional analysis of what actually occurs.28 


| Schedules of Reinforcement 


\ The reinforcement of operant behavior in ordinary life is not regular 
and uniform. \The fisherman does not hook a fish with every cast of the 
line; the crop the farmer sows does not always yield a harvest. Hence 

_the problem of maintaining or strengthening a response through inter- 
mittent reinforcement is more than a laboratory curiosity.° Skinner has 
explored extensively two main classes of intermittent reinforcement, now 
called interval reinforcement and ratio rej orcement. 

C By Pao Pedic a he crf ce aia given at fixed 
intervals of time, every 8 minutes or every 10 minutes, at the di 
of the experimenter. In the experiment itself this me 
ment of the first response that occurs after that interva 
is actually some slight variation in the time of reinforcement depending 
upon the activity of the learner. This arrangement, also called periodic 
reconditioning, or periodic reinforcement, delivers a standard amount of 
reinforcement per hour\It results in lawful rates of responding, the rate 
being proportional to the interval between reinforcements, the shorter 


tandard conditions 
und about 18 to 20 


ge of intergals. The 
gures Yand #0, 


scretion 
ans the reinforce- 
l of time, so there 


to another, provided drive remains TARR 
18 Skinner (1958a), pages 81-84. 


19 T} 


b "e expression “intermittent reinforcement” is a better descriptive term than 
partial reinforcement,” an earlier name for the same arran ement (Hilg: 
Marquis, 1940, page 347). 5 paige age 
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Although curves drawn in the scale of Figures 9 and 10 appear uni- 
form, there is actually a stepwise character to response when reinforce- 
ment is given at regular intervals. This follows because a response is never 
reinforced shortly after a prior reinforcement, and so, after a time, the 
rate of responding is low following a reinforcement. This minor fluctua- 
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“thin one session of fixed-interval reinforcement. A pellet 
ciel de: nore beng et 12 minutes, respectively. The more frequent the 
reinforcement, the more rapid the rate of responding, ac py ach rate is relatively 
uniform, After Skinner (1938), as reproduced by Hilgard and Marquis (1940), page 
151. 

“thin repeated sessions of fixed-interval reinforcement. 
OSURE M cate a Pie seem are given in Figure gias here accumulated 
for P Saee daily sessions. The uniformity of a persists K aougtont M hen CEs 
pressed as SacBee of responses per reinforcement this na 5 d oa ed as the “extine- 
tion ratio ” After Skinner 1938), as reproduced by Hilgard and Marquis (1940), 


page 151. 
es can be corrected by a modification known 


ti i £ the curv ‘ 
Orion aOR ment (also called aperiodic reinforcement. ) 


as variable-interval reinforce À 
Ua ariabie pee an average interval is substituted for a fixed 
r s % 


interval, so that, while a response MAY be reinforced every 5 minutes on 

the ae e, in spine cases the second reinforcement follows immediately 
ra; A A 

upon an erlie reinforcement, and at other times it is longer delayed, 
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Under such a schedule, performance is remarkably stable and uniform, 
and highly resistant to extinction. An illustration is found in Figure 11. 


A pigeon has given as many as 10,000 unreinforced responses in extinction 
following such variable-interval reinforcement. 


RESPONSES 


kK ONE HOUR —— 


FIGURE 11. Responses within variable-interval reinforcement. The curves 
the pecking responses of an individual pigeon reinforced at interv: 
10 seconds to 21 minutes, but averaging 5 minutes. Each of the sl 
sents 1,000 responses; the pen resets to zero after each 1,000. The 
resents some 20,000 responses in about 3 hours, with an av. 
per hour. Each reinforceme 


(1950), page 208. 


are of 
als ranging from 
loping lines repre- 
whole record rep- 
erage of 12 reinforcements 
nt is represented by a horizontal dash. From Skinner 


( The results under intermittent reinforcement show how input and 
output change under different schedules of reinforcement.) While the 
constancy of the extinction ratio under conditions of interval reinforce- 
ment suggests a standard input-output ratio, something very different 
happens under another arrangement of intermittent reinforcement, th 
known as ratio reinforcement. 
| Ain ratio reinforcement, instead of delivering a pellet of food at stand- 
ard intervals of time, a pellet is delivered after a standard number of 
responses. In one study they were delivered after 16, 24, 32, 48, 64, 96, 
or 192 responses. These ratios have to be approached gradually, for such 
occasional reinforcements can maintain responses only if responses are 
already being emitted at a rapid rate. The end-products of such training 
are shown in Figure 12. The somewhat paradoxical finding now is that 
the less frequent the reinforcement, the more rapid the response! The 


extinction ratio has changed from the order of 20:1 under interval rein- 
forcement to 200:1 under ratio reinforcement, 


The steplike character of performance, me 
reinforcement at fixed intervals, is pronounced with reinforcement at 
fixed ratios. After the burst of responding just before reinforcement there 
follows a period of very slow responding just after reinforcement. (If the 


at 


ntioned in connection with 
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ratio is made too high, there develop long periods of no response follow- 
ing reinforcement, a condition likened by Skinner to “abulia,” the in- 
ability to expend effort. He points out the analogy with the student who 
has finished a term paper, perhaps in a burst of speed as the deadline 
approaches, and then finds it difficult to start work on a new assignment.” 
The pause after reinforcement may be eliminated by adopting 
variable-ratio reinforcement, that is, using a range of ratios around a mean 
value. \Because the probability of reinforcement at any moment remains 
approximately constant, a uniform rate of responding ensues; because 
this probability is increased with rapid responding, the rate tends to be 
d as rapidly as five times per second and keep 


high. A pigeon may respon 
up this rate for many hours. (This rate corresponds to that of the ticking 


of a watch. ) 
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i ithin ratio reinforcement. Responses from individual rats 

M a Reap aa an 192 responses, as indicated by the horizontal lines, Un- 
on ee every a ary high rates of responding develop, the highest rate being 
ese circumsta f reinforcement. After Skinner (1988), as repro- 


found with the lowest frequency o 
duced by Hilgard and Marquis (1940), page 152. 


so be brought under multiple control. For example, 


by selecting one of three colors to be projected on the key that the 
pigeon pecks, the pigeon can be put on fixed-interval, fixed-ratio, or 
variable-interval reinforcement, and will behave as it has learned to be- 
have under each of these schedules alone. Skinner has trained a pigeon 
up to nine different performances under nine different conditions of stimu- 


leion d schedule—a multiple schedule without 


iation is a mixe 4 i 
Another vari: 1, That is, the pigeon has no differential external 


hether it will be reinforced after responding 


Schedules can al 


external stimulus contro 
stimulus to determine W 


20 Skinner (1953a), page 104. 
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30 times or after the passage of 15 minutes. These conditions are alter- 
nated more or less at random. The characteristic stable performance is 
thus described by Skinner: 


The bird runs off a number of responses at a high rate. If the current sched- 
ule is a ratio schedule, it will then be reinforced. If it is, however, an interval 
schedule, the bird will run somewhat past the number reinforced as a ratio and 
then will break suddenly to a zero rate of responding from which emerges a 
smooth interval scallop extending over the rest of the 15-minute period. That 
is, the bird “tests” the schedule by running off a number of responses sufficient to 
achieve reinforcement if the schedule is a ratio; if no reinforcement is forth- 
coming, the performance “primes” the bird into the interval scallop.” 

It is evident that the bird’s own behavior can function as a dis- 
criminative stimulus in controlling schedule performances. 

These more recent experiments, while belonging to the same gen- 
eral family of experiments as those reported in The behavior of organisms 
(1938), show a shift of emphasis. Earlier Skinner's attention was directed 
to the general course of a change in behavior over an experimental period 
or through a series of experimental sessions. His primary concern is now 
with the moment-to-moment behavior, or as he puts it, with the “fine 
grain” of the records rather than the over-all course of changes. With 
this change of emphasis the fundamental measure of response strength 
becomes the momentary probability of responding, rather than a more 
global “habit strength” or “reflex reserve.” In order to determine 
mentary probability, it is of course necessary to study frequency of re- 
sponse over more extended periods of time. Skinner’s emphasis upon 
frequency of response has persisted from his earliest writings, and he has 
recently ably defended his choice of this measure.? 


a mo- 


w” “Secondary Reinforcement 


The principle of secondary reinforcement is simply this: 


A stimulus that is n 


ot originally a reinforci ‘ye 
forcing through repeated Emay orcing one 


association with one that is,2* 
That is, through conditioning, a stimulus acquires the power to condition, 
Consider, for example, the acquisition of reinforcing power by the ie 
in the following experiment. When a rat presses a bar, a light comes on. 
After 1 second a pellet of food falls into the tray, reinforcing the bar- 
pressing. The light remains on for two seconds after the food appears. 
Several groups of animals are conditioned in this way, for 10, 20, 40, 80, 


- can become rein- 


21 Private communication date 
22 Skinner (1953b). 
23 Keller and Schoenfeld (1950), page 232. 


d December 28, 1954, 


\w 


- is thus either part of a chain or a 
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and 120 reinforcements, respectively. After conditioning, the rate of 
responding is reduced to a low level by extinction in the dark. Then 
pressing the bar again turns on the light for 1 second, but does not de- 


liver food, Under these circumstances responses again appear, showing 
that the light has acquired reinforcing properties. The number of re- 


sponses emitted in a 45-minute period increased with the number of 


prior pairings of light and food.** 
Other experiments suggest that the light will acquire secondary re- 


inforcing properties only if it appears before the reinforcing stimulus, and 
discriminative stimulus.” 


“(The following summary of secondary reinforcement shows its sys- 


tematic importance in operant behavior: 


(a. A stimulus that occasions Or accompanies a reinforcement acquires 
thereby reinforcing value of its own, and may be called a conditioned, secondary, 
or derived reinforcement.)A secondary reinforcement may be extinguished when 
repeatedly applied to a response for which there is no ultimate primary reinforce- 


ment. 
( 2. A secondary reinforcement is positive when the reinforcement with 


which it is correlated is positive, and negative when the latter is negative. 


3. Once established, a secondary reinforcement is independent and non- 
then the same response which produced the 


specific; >it will not only streng an 
original reinforcement, but it will also condition a new and unrelated response. 
Moreover, it will do so even in the presence of a different motive. 


L4. Through generalization, many stimuli besides the one correlated with 
ire reinforcing value—positive or negative.*° 


reinforcement acqui 
One of the most important consequences of the development of sec- 
ondary reinforcement js the emergence of a class of generalized rein- 
omes about because some secondary re-- 


forcers.?" This generalization © 
inforcers tend to accompany 4 variety of primary reinforcersMoney is 
a convenient illustration, because money provides access to food, drink, 


shelter, entertainment, and thus becomes a generalized reinforcer for a 
variety of activities. The so-called needs (need for attention, need for 
affection, need for 2E Jead to the kinds of persistent behavior best 
understood as a consequence of intermittent reinforcement, and the 
kinds of reinforcement sought are the generalized ones implied in the 
words “attention,” “affection, and “approval.” Language behavior, such 
as calling objects by their correct names, tends to be reinforced by the 
generalized reinforcement from the listeners, who show in indirect ways 
whether or not they understand and approve| According to Skinner, even- 


24 B i 

= Brehti on and Bersh (1950a); Webb and Nolan (1953). 
26 Keller and Schoenfeld (1950), page 260. 
27 Skinner (1953a), page 77-81. 
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tually generalized reinforcers are effective even though the primary 
reinforcers on which they are based no longer accompany them.’* This is 
close to the functional autonomy of motives, as proposed by Allport.”® 


“OTHER INFLUENCES AFFECTING OPERANT STRENGTH 


In the effort to remain descriptive and positivistic, Skinner has at- 
tempted to avoid the postulating of intermediaries not observed in his 
experiments, and to deal instead with a procedure that he calls functional 
analysis. A functional analysis is concerned with the lawfulness of re- 
lationships and the manner in which these relationships fluctuate under 
specified conditions. We have just reviewed one very important class of 
events in the functional analysis of behavior: reinforcements, as they 
affect operant strength. Two other classes of events are significantly cor- 


related with responses in the problems that Skinner studies. These are 
the classes known as drive and emotion. 


Drive 


Hours of food deprivation are important in determining the rate 
of responding in a Skinner box. The variable actually plotted is exactly 
that: hours of deprivation. ls it necessary to say anything further? Is it 
important to talk about physiological needs, or hunger? Let us first cite 
some typical experimental results and then return to these questions. 

Eight rats learned to press a lever under interval reinforcement. They 
practiced daily but received the main portion of their rations on alter- 
nate days. The correlation between responding and hours of depriva- 
tion showed up in high and low rates on successive da 
were created, matched according to their response rates during condition- 


ing. They were now extinguished on alternative days, one group when 
hunger was high, the other when hunger was low. The high hunger 
group yielded nearly double the responses of the low group on successive 
oe en although the two extinction curves showed similar curva- 
ture. 


ys. Two subgroups 


28 Skinner (1953a), page 81. 
29 Allport (19387). 
30 Skinner (1950), pages 201-202. 
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groups of animals received 1, 10, and 30 reinforcements, respectively. 
Then the strength of conditioning was tested by resistance to extinction, 
at a common level of deprivation (23 hours). Resistance to extinction 
was throughout correlated with the number of prior reinforcements, but 
not with the level of deprivation within reinforcement.” 

These two studies suffice to illustrate the kinds of relationships be- 
tween drive and operant conditioning ‘that are open to investigation. In 
a narrative account of the results we tend to move back and forth be- 
tween describing the rats as “deprived of food” or as “hungry.” The 
two expressions are operationally equivalent, though when we assign the 
results to “hunger” instead of to “hours without food” we tend to imply 
a theory. This leads us back to the problem of the status of “drive” as a 
concept. 

( Skinner is quite clear that he means by “drive” merely a set of op- 
erations (such as the withholding of food for a certain number of hours), 
which operations have an effect upon rate of responding. He is interested 
in the lawfulness of these effects under various circumstances. He objects 
to most of the current uses of “drive” by supporting the following asser- 


tions; * 


A drive is not a stimulus. | 

A drive is not a physiological state. 

A drive is not a psychic state. 

A drive is not simply a state of strength. 


By these negatives he makes it clear that he does not accept the 


stimuli from stomach contractions as the prototype of drives, nor does he 
accept physiological needs, or pleasures or pains, or desires or wishes. 
For the purposes of the systematic study of behavior, the word drive” is 
used only to acknowledge certain classes of operation which affect be- 
havior in ways other than the ways by which reinforcement affects it) He 
does not believe inference to an intermediary (intervening variable or 
hypothetical construct) to be necessary in order to carry out the func- 


tional analysis. 
Emotion and Punishment 
ropriately classified as stimuli, so, according 


n unwisely classified as responses. Weeping 
loss of a game is ordinarily said to be an 


Just as drives are inapp’ 
to Skinner, emotions are ofte: 


at a bruised shin or over the ? i l 
emotional response, but weeping because of a cinder in the eye is not. 
5 


81 Strassburger (1950). 
82 Skinner (1953a), pages 144-146. 
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This way of treating emotion is rejected in favor of referring it to a set 
of operations, in many ways like drive. Its importance arises because of 
the accompanying or ensuing changes in response. 

The effects of punishment are often emotional by this definition. 
Punishment might act dynamically as negative conditioning, as though 
reinforcement were being subtracted, but in fact it does not serve in this 
way. If at the beginning of extinction the rat is slapped on its feet when 
it presses the lever, its rate of responding is depressed, but it eventually 
recovers. The total responses to extinction have not been reduced by 
the punishment, as they should be if the punishment were negative con- 
ditioning.** Such a temporary effect upon rate but not upon total re- 
sistance to extinction is emotional. There is here some support for Thorn- 
dike’s belief that punishments do not act opposite to rewards. \ 

Because of the importance of the problem of ‘punishment, Estes’ 
(1944) study, carrying further Skinner’s earlier explorations, will be re- 
viewed later as an exemplification of Skinner’s system, 


{ DISCRIMINATION AND DIFFERENTIATION ' 


( In situations which require more than a change in rate of respond- 
ing, two features stand out. One is the discrimination between stimuli, so 
that a given response may be made to one of a pair of stimuli and not 
to the other member of the pair. A second feature is differentiation of 
response, so that response form or topography is alter 
appropriately to the situation./The complexities of beh 
derstood according to discriminations and differentiati 
appropriate chains or patterns. 


ed or adjusted 
avior can be un- 
ons arranged into 


Discrimination of Stimuli 


` The standard lever-pressing experiment m 
discriminatory conditioning if the lever-pressing delivers a pellet of food 
in the presence of a positive, supporting stimulus (eg. a 8 cp. light) 
and fails to deliver in the absence of this discriminatory stimulus. The 
rat learns to respond only when the light is on, but the light does not 
actually elicit the response. The difference between a discriminatory 
stimulus as an occasion for a response and as eliciting it is clarified by 
an example. I reach for a pencil lying on the desk, but I reach only when 
the pencil is there, and I do not reach for it just because it is there, While 


the pencil does not elicit reaching, it has something to do with my reach- 
38 Skinner (1938), page 154, f 


ay serve the purposes of 
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ing. If it were dark and there were no pencil there, I might grope for it 
because the discriminatory stimuli would be lacking. The pencil does 
not elicit reaching in the light any more than in the dark. It is only the 


occasion for reaching.** 


When the food is delivered only in the presence of the light, the 
situation is a sort of controlled intermittent reinforcement, for lever- 
pressing is reinforced only part of the time. Eventually response occurs 
almost exclusively when the light is on, so that nearly every response is 


reinforced. The resistance to extinction that is built up at this stage is 


not that of interval reinforcement, but that of ordinary every-trial rein- 


forcement. There are two operants with the same form of response, one 
light off; they are selectively reinforced 


with the light on, one with the 
and extinguished. There is an interaction called induction by Skinner 


which corresponds to Hull's generalization; whatever happens to one 


operant affects the other to some extent. 

(The following statements are offered to clarify this somewhat com- 
plex situation: ** 

1. Responses accumu 
are available in the presence © 


ple of inductive conditioning Or generalization. 
2, Responses emitted in the presence of the negative stimulus are no 


longer available under the positive stimulus. This is the principle of in- 


ductive extinction. 
8. Selective reinfo 


Jated in the presence of the positive stimulus 
f the negative stimulus. This is the princi- 


cement and extinction increase the number of 

responses available chiefly in the presence of the positive stimulus. This 

is the positive half of the principle of discrimination. 

4, Responses acquired in the presence of the positive stimulus may 
available under the negative stimulus. This is the 


become less readily Nene 
negative half of the principle of discrimination—the breakdown of in- 


duction. : i 
The discrimination experiment 1s always complicated by an addi- 


tional fact of importance. The positive stimulus which acts as the occa- 
sion for the response and for the reinforcement becomes itself a secondary 


reinforcing agent. 

Skinner believes th 
is superior to that usually use 
the animal. His rats may eithe’ 
do not have to choose between 


at his arrangement for obtaining discrimination 
d, in which a choice of responses confronts 
T respond to the lever or not respond. They 
two levers, or between a right and a 


84 Skinner (1938), page 178. 
35 Skinner (1938), page 229. 
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left turn. He believes that the discrimination box is a “crude instrument” 
for studying the nature of the process,®* and finds Tolman’s (1938a) de- 
pendence upon choice-point behavior a severe limitation.*? The objection 


is that in the choice situation no measure of strength is obtained—only a 
measure of relative strength. 


Differentiation of a Response 


Among a number of novel and useful distinctions made by Skinner 
is that between discrimination of stimuli and differentiation of response. 
In operant conditioning, reinforcement can be made contingent on either 
(a) the properties of accompanying stimuli (when the result is a dis- 
crimination), or (b) the properties of the response (when the result is 
a differentiation). 

The rat may be taught, for example, to press the lever with a given 
force, or to hold it down for a given duration, in order for the pellet to 
be delivered. The basic rule of operant conditioning applies: that the 
response must occur before it can be reinforced. Extreme forms or values 
may be obtained by successive approximations, Operant responses are 
emitted with an original range of form or intensity (Hull’s response 
oscillation). If only the more extreme values are reinforced, the whole 
distribution shifts, so that higher and higher values may be obtained. 
The training method of successive approximations permits the finally 
learned behavior to be very different from that originally emitted, 


r tour de force 
orate series of responses suggested by 


onsists of pulling a string to 


r of the cage, and d; ing it 
inside. Every step in the process had to be workei 5 ropping i 
of the rat.28 


Response novelty is one of the features ra 


ther badly accounted for 
in most stimulus-response systems, 


Thorndike’s law of assimilation, 
Guthrie’s principle of compromise movements, and Skinner’ 


approximations are all attempts to deal with the 
of emitted behavior has advantages over that of e 
always been embarrassing for theories of the c 
to find original stimuli to produce the responses 


S successive 
problem. The concept 
licited behavior, It has 
onditioning type to try 
called for in singing a 


38 Skinner (1938), page 231. 
87 Skinner (1938), page 437, 
88 Skinner (1938), Pages 339-340. 
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song or in writing a poem. Such stimuli need not be specified for emitted 


behavior. 


THE SEARCH FOR A UNIT OF BEHAVIOR 


In the foregoing account of some of the kinds of data which Skinner 
collects and of the concepts used in describing them, his more general 
theory has been implied rather than made explicit. Skinner is definitely 
interested in supporting a scientific theory of a specific kind. He favors 
what he calls a purely descriptive system, and a frankly analytical one. 
How he goes about system-making is best illustrated in his discussion 


of the reflex and its laws. 


Early Reliance on the Reflex as the Unit 

Skinner believes that a purely descriptive system, in order to be 
scientific and not a mere “potanizing” of behavior, must choose an ap- 
propriate natural unit of behavior. A real unit will not be an artificial bit 
of behavior taken improperly out of context, nor will it be something too 
complex to enter into orderly relations as a descriptive unit. The level of 
specification found to be necessary is not something a priori, but is the 
level found in experience “marked by the orderliness of dynamic 
changes.” * Skinner believed, especially at the outset, that his system 
had a merit not possessed by other stimulus-response systems—the dis- 
covery of this appropriate unit: the reflex, as he defined this term. 

By a reflex Skinner means a lawful correlation between a class of 
stimuli and a class of responses: The reflex is not to be identified with 
the spinal reflex, which is defined topographically; its reference is solely 
to behavior and not to anatomy OF neurology. The definition rests on a 
dynamic relationship. What a stimulus is, what a response is, what a 
reflex is can be defined only according to the lawful experimental con- 


sequences when the reflex unit is isolated. 


n analytical unit is actually obtained in practice. The unit is 


ajio z idity and the validity of the laws describing its changes do not 
ae val on ete SP of analytical assumptions or the possibility of a 


later synthesis of more complex behavior.*° 
The laws of the reflex. There are static laws, dynamic laws, and laws 


of interaction.** Beam : 
The static laws refer to the quantitative properties of representa- 
39 Skinner (1938), page 40. 


40 Skinner (1938), page 29. 
41 Skinner (1938), pages 1 


The reflex as a 
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tive reflexes. They are reminiscent of the laws of spinal reflexes: the law 
of threshold, the law of latency, the law of magnitude of response as a 
function of intensity, the law of after-discharge, and the law of temporal 
summation. 

The dynamic laws include “classical examples,” which means, pre- 
sumably, that they too derive from the study of spinal reflexes: the law 
of refractory phase, the law of reflex fatigue, the law of facilitation, the 
law of inhibition. In addition there are other examples, logically similar 
but differing in the nature of the operations affecting reflex strength: 
the laws of conditioning, extinction, drive, and emotion. These h 
already been considered. 

The laws of interaction give an answer to the part-whole problem. 
They are: the law of compatibility, the law of prepotency, the law of 
algebraic summation, the law of blending, the law of spatial summa- 
tion, the law of chaining, and the law of induction. 

There is a family resemblance between the topics covered in Skinner’s 
laws and in Hull’s postulates, although as they are given by Skinner the 
laws are defining principles and are not stated so as to be used to deduce 
behavior. Skinner’s laws are unlikely to be found false because they are 
worded in such a form that they are true at the level of general observa- 
tion. They provide a framework for further specification and quantifica- 
tion. They enter exposition quite differently from Hull’s postulates, Ac- 
tually, there is little reference to the laws as the data from experiments 
are discussed. 

How laws of this kind define a descriptive system and lead to ex- 
periments might be illustrated by a parallel example from plant growth, 
Preliminary observation would soon establish the fact that growth varies 
with the amount of water in the soil, the range of temperature, and the 
amount of sunlight. Before proceeding to further experimental work it 
would be possible to state a series of growth laws: one law relating 
growth to moisture, another to temperature, another to illumination, 
together with some presumptive laws of their interaction, F 
it is plausible that with more moistur 
at somewhat higher room tem 


t according to 
any regard for the physiology of cells 
or for the chemistry of growth horm 


ones. The observations would, in fact, 
be governed largely by the descriptive system implied in the laws, but 


ave : 
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their aim is not to prove or disprove these laws, for as far as it goes each 
law merely describes experimental findings. This way of investigation is 
essentially what Skinner has proposed and what he has succeeded in 
doing. There is this difference, that the isolation of a reflex to be studied 
and the selection of units of measurement is a more perplexing problem 


than defining plant growth. 


The Functional Unit Studied Is Not Necessarily the 

“Atom” of Behavior 

While Skinner continues to search for the appropriate unit of analy- 
sis, the basic datum, he appears no longer to be confident that he has 
found the unit either in the reflex or in the measured operant. Three 
terms enter into the characterization of a conditioned operant: the dis- 
criminative stimulus that is the occasion for the response, the response, 
and the reinforcement. The “same” response is not the “same” operant 
if correlated with different stimuli or with different reinforcers. The oper- 


ant under study is perhaps not a “single” operant. 


A somewhat modified suggestion as to an “element” of behavior 


emerges in the following passage which follows a discussion of the prob- 


lem of transfer: 

This leads us to identify the clement rather than the response as the unit of 
behavior. It is a sort of behavioral atom, which may never appear by itself upon 
any single occasion but is the essential ingredient or component of all instances. 
The reinforcement of a response increases the probability of all responses con- 


taining the same elements.** 

We lack adequate tools to deal with the continuity of behavior or with the 
interaction among operants attributable to common atomic units. The operant 
represents a valid level of analysis, however, because the properties which deine 
a response are observable data. A given set of properties may be given a functional 

ntually be developed which will not emphasize 


unity. Although methods must ever ? ) 
units at this level, they are not necessary to our understanding of the principal 


dynamic properties of behavior.** 
These quotations suggest that Skinner believes we are still at a very 


preliminary stage in the development of behavioral science; Perhaps his 
objection to theory (as theory is used by others) arises because of his 
strong conviction that we need to discard old concepts and make a fresh 
start. We are not far enough along this new road to begin the more de- 


tailed analysis called for in matured theory. 


42 Ski 1953a), page 94- 7 3 
43 Sines tapaasi, P 95. Emphasis added. 
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j 


A Chained Responses as Illustrative of Functional Units 


What is the practical meaning of assigning “functional unity” to a 
given set of properties? Some light can be thrown on this by an analysis 
of chaining, for actual behavior as we know it comes in sequences larger 
than those of bar-pressing or spot-pecking, 

Even the bar-pressing operant is in reality a chain: 


The use of a chain cannot be avoided in operant beh 


avior because the very 
act of reinforcement implies it.*# 


In their discussion of chaining, Keller and Schoenfeld give a lucid 
account of the problem of determining the degree of analysis desirable 
in order to remain at the level of experimental specification.** They be- 


gin with six distinct operants describing what the rat actuall 


ly does in 
a Skinner box: 


Mole a a Response of the Rat 
Bar-location Approach of rat to front of box 
Visual bar Rising on hind legs; placing paws on bar 
Tactual bar Pressing of bar, thus activating food-magazine 
Apparatus noise Lowering foreparts to food-tray 
Visual pellet Seizing of pellet by teeth and paws 
ss sian, ation ae Pellet-in-mouth Chewing of pellet 


It would be possible, as they suggest, to lengthen this list, includ- 
ing the several responses making up the approach from the door to the 
bar-location, and the several ingestion responses that follow upon the 


; ver, they find it equally legitimate to re- 
duce the list, as follows: 


Operant Discriminative Response of 
Number Stimulus the Rat 
T saaa sev: axbny Visual bar Rising 
D W a Tactual bar Pressing 
Gi Ae RARER ean Apparatus noise Lowering 
E AME ET Visual pellet Seizing 


The advantage of this reduction lies in the 


clearly observable and 
regularly recurring sequence or chain of responses 


» with identifiable (and 


44 Skinner (1938), page 43. 
45 Keller and Schoenfeld (1950), pages 197-208. 
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controllable) stimuli. In this chain each response produces the dis- 
criminative stimulus for the next response. 

Does the chain operate as a unit? The well-conditioned rat makes 
the transitions so smoothly that it seems to be giving one response, not 
four. But the independence of the units of the chain can be tested ex- 
perimentally. 

1. If we eliminate only the stimulus for the final unit of the chain 
(the pellet), as in extinction, the first three links of the chain are gradu- 
ally weakened, but the last is unaffected. The rat will seize and eat a 
pellet exactly as before. 

9, If we now eliminate both the third and fourth links in the chain 
(the apparatus noise as well as the pellet), and carry out extinction, we 
can find out more about the chain. Reintroducing the noise after extinc- 
tion is well along again reinforces bar-pressing. Hence, during condi- 
tioning, the third link had become a secondary reinforcer. Furthermore, 
eding two links in the chain did not extinguish 


the extinction of the prec 
ird link. 


the reinforcing properties of the th 
This kind of functional and experimental study isolates units of the 


chain which preserve some independence in the whole. These units are 
part of a chain, and their distinctiveness as units is not entirely arbi- 
trary. It would be possible to record the separate responses, and not only 
the final one. It is suggested not only that future investigations will un- 
doubtedly employ such procedures, but that exploratory attempts have 
already been made.** , , 

The unit appropriate for experimental study turns out, in fact, to 
have a measure of arbitrariness about it. If it is a matter of convenience 
whether to measure one response of the chain, or four responses, or many 
more than that, there is evidently some selectivity exercised by the experi- 
menter. This is always true: all description is partial description. The 
nature of events is such that description of them cannot be exhaustive. 
The functional unit, for Skinner, as for other expemmentérs, is not given 
completely by the processes under investigation. Sometimes the functional 
unit is a simple response, sometimes 2 complex act, sometimes a rate of 
responding. The unit no longer has the clean dimensions of a correlation 
between a.class of stimuli and a class of responses as implied in thie origi- 
nal concept of a reflex. The atom of behavior proves to be evasive. 


46 Keller and Schoenfeld (1950), page 203. The attempts cited are those of Arnold 


(1947). 
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EXTENSION AND APPLICATION OF OPERANT 
CONDITIONING 


The generality of Skinner’s approach to problems of behavior is 
suggested by the titles chosen for his books: The behavior of organisms 
(1938) and Science and human behavior (1953). Neither title betrays 
that the precise data derive largely from experiments on rats and pigeons. 
Although the data are now being supplemented by experiments with 
human subjects, both children and adults, normal and disturbed, in these 
books the extension of the theory is by analogy rather than by experimen- 
tation. From a scientific point of view (using the word “science” as 
Skinner uses it) the extensions merely explore the kinds of variables 
which may eventually be brought under scientific control, The ultimate 
possibilities, as Skinner envisages them, are boundless: one chapter he 
includes is entitled “Designing a culture.” ‘4 We are not here concerned 
with the larger political and ethical aspects of the society he presents, 
Instead, our problem is to see the steps by which laboratory learning is 
extended to encompass learning in a larger context. For our purposes 
we may therefore select some typical problem areas and see how they 


are handled. Three such problem areas are schoolroom learning, verbal 
behavior, and psychotherapy. 


. L$ choolroom Learning 
Skinner has announced, and embarked upon, a program of investiga- 
teaching of arithmetic, read- 


47 Skinner’s novel Walden Two (1948) is a sketch of ; 2 
the possibility of a scientific culture. amoni Utopia portraying 
48 Skinner (1954), 
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in determining the proper order of presentation. We can learn about 
schedules of reinforcement from precise experiments and then build them 
into our machines. Because some schedules yield far more in the way of 
stability of response than other schedules, we can hope to save a great 
deal of time in the teaching of our children. 

While as described here, the new technique can serve only to teach 
a “skill,” it must be acknowledged that a great deal of time and effort 
teaching, and that failures entail great social cost. If 


goes into such 
they deserve a try, though their limita- 


the new methods prove promising, 
tions must of course be understood. Schoolroom learning is a social ex- 


perience as well as an opportunity to learn the multiplication tables. No 


new machine will make the teacher superfluous! 


Verbal Behavior 
ost clearly distinguishes human behavior from that of 


Knowledge of how we acquire language, and how we 
use it, is essential to an understanding of human learning. Skinner has 
long been interested in verbal behavior. As early as 1936 he produced a 
phonograph record composed of chance groupings af speech sound, 
which because they were chance were inherently meaningless. The rec- 
ord, called a “verbal summator,” was used to study the words “read into” 
sounds by the listener. It was a kind of projective technique, similar in 
the auditory field to the ink-blots used in the visual field.*° Within the 


next few years Skinner repor 
and other kinds of sound patte 


Language m 
other mammals. 


ted studies of word association, alliteration, 
rning.°° His William James Lectures, de- 


livered at Harvard University in 1948, are to appear eventually in revised 
form as a volume entitled Verbal behavior. We can see that verbal be- 


havior has been taken seriously as an empirical problem. 
The main point of the analysis is that speech sounds are emitted 


(and reinforced) as any other bits of behavior.*t Some speech utter- 
‘on the hearer and get reinforced as the hearer 


ances make demands up E s 
complies. This function (called the “mand” function) appears early 
in the language behavior of the child. A second function is concerned 


largely with naming (the “tact” function). The tact function leads to the 
ee and versatility of language. Its reinforcement by the hearer is 


E A -ow and Rosenzweig (1940). 
Sma (Ot ran Skinner (1939), Skinner (1989) (1941). 
51 figer | ilable treatment at the time of writing is the section on verbal be- 
The best available trea Jd (1950), pages 376-399, on ‘ny the following brief 
i ations are to be fi in Ski 
digest is based. A few su d illustrations are to be found in Skinner (19538a), 


pages 210-216. 
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more general than mand reinforcement. The reciprocal relationship be- 
tween the speaker and the hearer, involving mutual reinforcement, is com- 
plex, but subject to straightforward analysis. The analysis calls for no 
new principles, however, beyond those familiar in operant conditioning. 


Psychotherapy °* 


The need for psychotherapy is said to result from the by-products 
of excessive control by other people (especially through punishment), 
whereby the individual is either incapacitated or rendered d 
to himself or to others. Excessive control has by-products that 
emotional (fear, anxiety, anger or rage, depression ) 
ant behavior (drug addiction, excessively vigorous or 
defective discrimination of stimuli, 
self-stimulation ). 

The patient turns to the therapist because any relief (or promise of 
relief) is positively reinforcing. Psychoanalytic therapy can be character- 
ized most simply, according to Skinner, as follows: the ther. 
tutes himself a non-punishing audience. Under these circu 
sponses repressed by punishment tend to return. Forgotten experiences 
may be recovered, the patient may act aggressively (or at least verbalize 
aggressive impulses), the patient may exhibit strong emotion. The appear- 
ance of previously punished behavior in the presence of the non 
therapist makes possible the extinction of some of the effects of punish- 
ment. This Skinner believes to be the principal result of such therapy, 

This plausible account calls for no “explanatory fictions’—no id, ego, 
or superego, inhabiting a psychic or mental world. 

What is “wrong” with the individual who dis 
punishment is easily stated. A p 
ism whose behavior is disadv: 
vantageous or dangerous must be specified i 
quences both to the individual himself 


to supplement a personal history in such a way that behavior no longer has these 
characteristics.5? 


angerous 
are either 
or revealed in oper- 
restrained behavior, 
defective self-knowledge, aversive 


‘apist consti- 
mstances, re- 


-punishing 


Psychotherapy rests on the direct investigation and redirection of 
behavior itself. Behavior is itself the subject matter of therapy; the be- 


havior is not the symptom of some other subject matter, 
These conjectures are based on anal 


and experimental studies largely on lo 
ever, the methods have been extended 


ogies between human behavior 
wer forms. More recently, how- 


to the study of human patients 
5? Skinner (1953a), pages 359-383, 
58 Skinner (1953a), page 372. 
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hospitalized with psychoses. Preliminary results show that vending ma- 
chines with candy or cigarettes or pictures as reinforcing stimuli can 
sustain operant behavior over long periods of experimentation lasting 
an hour a day. It is supposed that results will be even more satisfactory 
with food reinforcement, and this is being tried. The therapeutic impli- 
cations thus far have not been of primary concern, but periods of psy- 
chotic activity within the experimental hour appear reduced in frequency 
and duration, at least for some patients.” 

The foregoing discussion of teaching, verbal behavior, and psycho- 
therapy suffices to show the flavor of Skinner’s theory as applied to a 
broader range of problems. The steps from the present knowledge of 
behavior to these broader fields are largely matters of technique and 
ather than of the discovery of new scientific principles. 


engineering, T 
of dynamic principles is expected to emerge as 


Nothing new in the way 
the broader subject matter is investigated. 


THE CONSEQUENCES OF PUNISHMENT AS 
ILLUSTRATIVE OF EXPERIMENTS WITH OPERANT 
BEHAVIOR 


Some of the suggestions about the action of punishment which Skinner 
had earlier made were more extensively studied by Estes (1944) under his 
direction. Estes’ work serves as a convenient example of the experimental 
approach to a problem by an investigator adopting Skinner's general point 
of view. Estes chose a more conventional terminology than Skinner, avoid- 
ing words like “operant” or “reflex” or “emitted,” but the experiments could 
have been described in Skinner’s words. It is of some interest that Estes 
found the more usual vocabulary satisfactorily precise for his purposes. 

On the assumption that punishment does have effects, the two chief 
possibilities are that it weakens the habit or that it merely suppresses 
response. If the habit is weakened, punishment acts as a negative rein- 
forcement, or as an agent hastening extinction. If the response is merely 
suppressed, the response has not been eliminated from the organism's 
repertory. Skinner's preliminary experiments favored the latter alternative, 
that punishment suppresses the rate of responding without eliminating 
responses from the total of responses to appear in extinction. Estes points 
out that the alternatives are genuine ones in the light of clinical experience, 
It is familiar that behavior may be repressed, and not overtly expressed, 
although the tendencies continue to exist at considerable strength. 


54 Skinner, Solomon, and others ( 1954) 


as 
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Punishment Does Not Act as a Negative Reinforcement 


In agreement with Skinner (and the later Thorndike), Estes finds 
that punishment does not lead to a reduction in the total number of 
responses given during extinction, even though there is temporary sup- 
pression of response following the punishment. Some characteristic find- 
ings are shown in Figure 13. 


300 


C] Not Punished 


HB Punished in First 
Extinction Period 


200 


RESPONSES 


100 


BEFORE 1 2 3 
BSTINGTION EXTINCTION PERIODS 

FIGURE 13. Evidence that punishment depresses rate of responding without 
decreasing the total responses within extinction, Responses of two 
each are compared. While the groups were not alike in their res e 
to extinction, their rates have been equated, and other comparisons 
this adjustment. Within the first extinction period one group of rats rec 
average of 9 punishments (electric shocks while pressing the lever), This punish- 
ment reduced the number of responses on the first day, and perhaps o 
but by the third day there was a compensatory increase in the numbe 
so that the total’ responses of the group punished on the first day 
not punished are the same over the three days of extinction, a 
Experiment A, corrected for differences prior to extinction. 


On the basis of additional experiments in the series, Estes concludes 
that the total number of elicitations of the response necessary for extinc- 
tion may be reduced somewhat by punishment, but the time required for 
complete extinction of the response will not be affected by the punishment, 
More generally, the conclusion is reached that a response cannot be 
eliminated from an organism’s repertory by the action of punishment alone, 


SKINNER’S OPERANT CONDITIONING T: 


Punishment More Importantly Associated with Stimuli 

Than with Responses 

If punishment were a reinforcing agent similar to reward, it would 
act on responses, in accordance with Skinner’s principle of Type R con- 
ditioning. That its more significant correlation is with discriminative stimuli 
is pointed out by Estes on the basis of several kinds of evidence. 

In one experiment * the animals of the experimental group were 
shocked at intervals of approximately 30 seconds, care being taken not 
to shock them during or immediately after a response to the lever. Re- 
covery from the effects of punishment follows the same course (i.e., the 


same mathematical function fits the course ) as the course following punish- 
Jf. It is argued therefore that the effect must be 


ment of the response itse 
bing stimuli with stimuli in the box which 


due to the contiguity of the distur 


normally act as occasions for lever pressing. 
This conjecture was tested in another experiment * in which the rat 


was left in the box for an adaptation period following punishment. During 
this time the lever was withdrawn, so that response to it was impossible. 
The effects of the short period of severe punishment were almost com- 
pletely dispelled, confirming the interpretation that punishment was re- 
lated to the stimulating situation rather than specifically to the response. 

Intermittent Punishment More Effective than Punishment 


at Every Occurrence 
delivered every time the response is made, (the rate 


of responding is seriously depressed. While it is not depressed as much 
if punishment is given only_oc ionally, the effects of punishment persist 
longer in the latter case.** This is shown in two experiments, in one of 
which the test is made by simple extinction, the other in which the ex- 
tinction follows an adaptation period. The results are most striking fol- 
lowing adaptation. Adaptation does not bring recovery after intermittent 
punishment as it does after every-trial punishment. The results are shown 
in Figure 14. , 

Estes offers some conjectures in explanation of these results. He ac- 
cepts a twofold theory of the effect of punishment. One principle, already 
referred to, is that punishment creates an emotional state which depresses 


If punishment is 


55 Estes (1944), Expat m 

56 Eş periment J. $ 

87 oie eae TTeepertments E and K. The average number of punishments re- 
ceived by each eae turns out to be about the same for both groups. Hence it is 
legitimate to compare the long-time effects of the two types of punishment. 
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responses while the state is aroused. This consequence of punishment does 
not lower the resistance to extinction. It is recovered from by adaptation 
or during subsequent extinction trials. The second principle is one con- 
gruent with Hull’s interpretation of punishment, that is, an interpretation 
of the cessation of shock as reinforcing. Estes believes that the withdrawal 
responses are positively conditioned by the termination of the shock de- 


CI Not Punished 
100 HB Punished Periodically 
Punished Regularly 


50 


RESPONSES 


EXTINCTION EXTINCTION 
(PUNISHMENT DAY) (AFTER ADAPTATION) 


FIGURE 14. Evidence that intermittent punishment is more effective than regular 
punishment. The mean numbers of responses of three groups are shown, a control 
group extinguished in the usual manner, a group = for responding whenever 
response occurred, and a group punished periodically, for Tesponses occurring within 
every fourth minute of the 40-minute extinction period. Following the experimental 
day there were two adaptation days, when the animals were placed in the boxes 
with the levers retracted. The results for extinction are for the first day with the 
exposed lever, after these adaptation days. The adaptation period has almost com- 
~~ eradicated the results of regular punishment, but the consequences of periodic 


punishment persist. Each group consisted of 6 rats, Data from Estes (1944), Ex- 
periment K, corrected for differences prior to extinction. 


livered by the lever. He believes, further, that such withdrawal responses 


are conditioned more strongly through intermittent reinforcement than 


through ordinary continuous reinforcement, consistent with Skinner’s find- 
ings for conditioning in general. 


Punishment in the Practical Control of Behavior 


Estes suggests several practical implications of his study. 
The main finding is that a response cannot be eliminated from the 


organism's repertory more rapidly with the aid of punishment than with- 
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out it. Permanent weakening comes about only by non-reinforced elicita- 
tion—and this weakening process may be prevented if punishment sup- 
presses the response. This is in line with clinical findings about forms of 
aggressive or hostile responses which have come under parental or other 
social punishment. They are not eliminated until they can be brought to 
free expression, when the behavior can be appropriately redirected. 
Another disadvantage of punishment lies in the concomitant emotional 
state which is aroused. This state will suppress other responses in addition 


to the one punished. The lack of specificity of its target is a weakness of 


punishment. 

On the positive side, t 
when and how punishment may be useful: 

1. Punishment may be used to hold a response at low strength. Under 


these circumstances, punishment has to continue indefinitely, as it does 
not eliminate the response. The continuing punishment is equally or more 
effective if administered occasionally than if it is given every time the 
objectionable behavior occurs. 


2. It is possible to take advantage of the period of suppressed response 
der to strengthen some other response by rein- 


here are several “rules” which serve to show 


following punishment in or 


forcement. f ; 
8. It is important that punishment be given in the presence of the 


discriminative cues for the response.(Delayed punishment is likely to prove 
ineffective)because it is given at a time when the discriminative stimuli 
a 


leading to the undesired conduct are absent. E l 

The series of experiments by Estes, with their interpretations, show 
how, within the experimental and theoretical framework of Skinner's 
system, it is possible to experiment upon problems genuinely relevant to 
the practical control of learning situations. The challenging difficulty 
remains of finding appropriate ways to test the implications in a social 


context. P 


F OF SKINNER’S SYSTEM 


Skinner’s Position on Typical Problems of Learning 


Now that we have available general textbooks written from the stand- 
point of Skinner’s position, it is possible t make a summary in relation to 
problems that do not fit directly into his conceptual scheme. 

1. Capacity. In a descriptive system, it is to be expected that the 
laws will contain empirical constants differing for various species and 
for different members of each species. The eating rate, for example, can- 
not be expected to remain the same for young and old animals, and for 
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animals unlike in their food preferences. Because lawfulness rather than 
laws is what the systematist insists upon, differences in capacity are not 
of central importance. There is no suggestion that at higher capacity levels 
the laws are essentially any different; verbal behavior in man, for example, 
is said to conform to the general principles of operant behavior. 

Skinner argues against the usefulness of a “trait” description in study- 
ing individual differences. A trait-name does not refer to any unit of be- 
havior suitable for study through the functional analysis that he recom- 
mends.°** 

2. Practice. Something like a simple law of exercise (practice under 
conditions of contiguity of stimulus and response) is accepted for Type S 
conditioning. The conditioning that occurs under Type R depends upon 
repeated reinforcement. The possibility is favored that maximum rein- 
forcement may occur in a single trial for the single operant, but the single 
operant is not achieved experimentally. In the usual case, the accumulation 
of strength with repeated reinforcement depends upon a population of 
discriminated stimuli and the related operants, more or less after the manner 
of Guthrie’s explanation of the interpretation of the acquisition of skill 
with practice. 

The effect of a single reinforcement is greatly enhanced by the 
secondary reinforcement deriving from it. Hence there is no direct cor- 
respondence between the number of responses yielded in extinction and 
the number of responses reinforced. The schedule of reinforcements is 
very important, with the greatest yields in the way of resistance to ex- 
tinction coming from intermittent reinforcement (interval or r 
forcement ). 

3. Motivation. In agreement with Thorndike,(reward is found to in- 
crease operant strength, while punishment has no corresponding weak- 
ening influence. Although there are several ways in which punishment 
enters, one is to create the variable called emotion, which reduces the rate 


of responding without reducing the number of res 
tinction. 


Drive level (determined by degree of deprivation) affects the r 
of responding, but its level does not influence the effectiveness of a r 
forcing stimulus in producing resistance to extinction. There is an unre- 


solved problem of a lower limit here, for there must be enough drive 
to maintain responding and to make the reinforcer relevant.5° 


atio rein- 


ponses yielded in ex- 


ate 
ein- 


58 Skinner (1953a), pages 194-203. 


°° We shall meet this same problem in the next chapter, when Hull’s interpretation of 
reinforcement is under consideration. 
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4, Understanding. The word “insight” does not occur in the indexes 
of Skinner’s books. Keller and Schoenfeld identify insight with rapid learn- 
ing, and then explain it on the basis of (1) similarity of the present prob- 
lem to one earlier solved, or (2) the simplicity of the problem. 

Because solving a problem means only the appearance of a solution in 
the form of a response, Skinner believes that the technique of problem- 
solving is merely that of manipulating variables which may lead to the 
emission of the response. No new factor of originality is involved. This 
makes it possible to teach a child to think.” 

5. Transfer. Skinner prefers to use the term “induction” for what is 


commonly called generalization in the literature of conditioning. Such 


induction is the basis of transfer. The reinforcement of a response in- 
creases the probability of all responses containing the same elements. 
the control acquired by a stimulus is shared by other stimuli 


Similarly, 
tion of transfer is, in spirit, very 


with common properties. This interpreta 


similar to Thorndike’s. 
6. Forgetting. There is no special theory of forgetting proposed, al- 


though the distinction between extinction and forgetting is carefully main- 
tained, Conditioning is long remembered. A pigeon showed sizable ex- 
tinction curves six years after the response had been last reinforced. 


Extinction, too, is retained. Spontaneous recovery does not mean that ex- 
» too, 1s 


tinction is forgotten, for successive extinctions show the results of earlier 


ones, 

True forgetting ap 
Other kinds of response 
tion or interference by incompa 
applies to so-called Freudian forgetting. 


pears to be a slow process of decay with time. 
decrement are better subsumed under extinc- 
tible responses. The latter interpretation 


The Reflex No Longer 4 Satisfactory Unit 
st committed himself to the reflex as his unit (Skinner, 
the correlation between a class of stimuli and a 
this correlation was lawful, he believed the term 
A case could be made for his position, but the 
consequence was no more than to discover as his unit what Thorndike 
had earlier named the “connection” or “bond.” Thorndike, too, accepted 
the correlation between situations and responses as the basis for his unit. 
When Skinner developed the notion of emitted behavior, his earlier 
unit was no longer appropriate. The discriminated operant, which is the 


When he fir 
1931), he referred to 
class of responses. If 
“reflex” to be appropriate. 


69 Keller and Schoenfeld (1950), page 60. 
81 Skinner (1953a), pages 252-256. 
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basis of experiment and theory in the newer writing, he early referred to 
as a pseudo-reflex.* He pointed out that it would be a great mistake to 
suppose that the correlation established between a discriminated stimulus 
and an operant followed the laws of the reflex. Thus the systematic cor- 
relation between stimuli and responses, a legitimate extension of the 
historical term “reflex,” was no longer appropriate in dealing with operants. 
The terms which enter a reflex relationship should be those of “stimulus” 
and “response.” Skinner's concern has appropriately shifted to mere law- 
fulness of behavior. The reflex proper (the basis for respondent behavior) 
is limited to autonomic and postural responses; the operant can just as 
well run under its own name, and not be confused with these reflexes. 
Instead of insisting on the analysis into reflex units, Skinner now appears 
somewhat less sure about the ultimate unit, and is temporarily satisfied 
with a functional analysis at the level of experimental convenience. 


Co-ordination with Other Systematic Viewpoints 


Skinner's “fresh start” approach to psychology has made it difficult 
for him to use the data collected by others, and, on principle, he rejects their 
concepts. His role in reference to the theories of others—insofar as he 
has paid any attention to their claims—has been chiefly that of a trenchant 
critic. He has felt no responsibility for the task of inter-investigator co- 
ordination. In his book Science and human behavior (1958), written 
as a textbook, he used no literature citations, and he mentions by name, 
among writers with some place in learning theory, only Thorndike, Pavlov, 
and Freud. 

That some co-ordination is possible is shown by Keller and Schoenfeld 
(1950), who made a serious effort to bridge the gap between the data col- 
lected by Skinner and his followers and by others interested in learning, 
They were able to find much that they could use, even though they re- 
mained within the conceptual structure of Skinner’s system, 

The almost exclusive preoccupation with animal experiments (except 
for the studies of verbal behavior, and work with abnormal human sub- 
jects thus far only sketchily reported) raises some interesting questions. 
Presumably a functional, descriptive psychology of this kind must ul- 
timately design experiments at the level at which scientific statements 
are to be made. As Verplanck points out, it is somewhat anomalous th 
a systematist who refuses to predict what a rat or pigeon will do—because 
such prediction does not belong in a scientific study of behavior—is willing 

® Skinner (1935). 


at 
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to make confident assertions about the most complex forms of human be- 
havior, economic, political, religious. The intervening steps appear to be 
missing, and the theory makes no formal provision for them. 


The Anti-Theory Argument 

Skinner's espousal of a descriptive, positivistic position and his op- 
position toa hypothetico-deductive system have frequently been reasserted, 
nowhere more clearly than in his presidential address before the Mid- 
western Psychological Association, entitled: “Are theories of learning 
necessary?” “* 

The theories he opposes are of at least two kinds, One kind is that 
which would find its explanations at a different level of discourse, using, 
for example, a physiological explanation for psychological events. There 
is no real objection to such a theory, unless it is overly insisted upon. The 
independence of the behavioral subject matter must be permitted, and 
explanatory mechanisms, especially explanatory fictions, are to be avoided. 
He challenges another kind of theory, commonly called hypothetico-deduc- 
tive, which stresses the formulation of hypotheses and their testing. This 
kind of theory Skinner objects to as a futile and extravagant exercise. The 
argument can be stated simply enough. The end result of scientific investi- 
gation, whether the investigation is deseniptive or an effort to confirm a 
theory, is a described functional relationship demonstrated in the data. 
Because the end result is the same, the only point at issue is really which 
method more efficiently directs the inquiry. Skinner is firmly of the opinion 
that theories are wasteful and misleading. They usually send the investi- 
gator down the wrong paths, and even if the scientific logic makes them 
self-correcting, the paths back are strewn with discarded theories. Once 
the data are in order, the theories tend to drop out. “If the theories have 
played no part in the design of the experiment, we need not be sorry to see 
them go.” i , , é 

Criticism of existing hypothetico-deductive aeons in psychology is 
easy. This does not mean that their scientific logic is wrong, It may mean 
hatsome particular efforts have been premature or misdirected. Perhaps 
Skinner is right in criticizing unripe theorizing, but surely he is wrong if 
he means that we should return to a Baconian conception of science. The 
power achieved by other sciences through theoretical formulation is too 
convincing for such formulation not to serve at least as a long-range goal 


63 Verplanck (1954), page 31l. 
6t Skinner (1950). 
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for those trying to create a science of behavior. That Skinner's approach is 
a fruitful one need not lead to the conclusion that other approaches are 
futile. 

One of the next steps for a science, after lawful relationships have 
been discovered and appreciable amounts of data collected, is to represent 
the data in a kind of shorthand Summary, using a minimal number of terms, 
and mathematical representation, This kind of systematic science Skinner 


can support, because it does not call for inferences alien to the data of 
observation. But he says of it: 


We do not seem to be ready for a theory in this sense. At the moment we 
make little use of empirical, let alone rational, equations. A few of the present 
curves could have been fairly closely fitted. But the most elementary preliminary 
survey shows that there are many relevant variables, and until their importance 
has been experimentally determined, an equation that allows for them will have 
so many arbitrary constants that a good fit will be a matter of course and cause 


for very little satisfaction,» 

It is not surprising that a r 
remark that Skinner's is “a hi 
theory.” 6s 


ecent critic has found it appropriate to 
ghly formal, but not a highly formalized, 


Success of the System 


Two major achievements give the system a firm pl 
psychology. 

At the level of data, the most novel and interesting data are those 
deriving from the various schedules of intermittent reinfore 
the sketchiest of experiments ha e 
fascinating feld,” and the kind of operant behavior st 
has lent itself beautifully to this kind of investigation, It is a far cry 
pointing out that rewards influence learning to 
trol whereby several thousands of responses are yielded at a uniform rate 
during extinction following a particular pattern of rewarding, 

At the level of practical behavior the most striking r 
obtained in animal training. While these h 
theoretical interest, it is not w: 
cleverness on the part of the trainers. These practic 


ace in contemporary 


from 
achieving the kind of con- 


thus far.(No other learning theorist has been able to train an animal before 
68 Skinner (1950), page 216. 
s8 Verplanck (1954), page 295. 


°* For a critical review of the literature, see Jenkins and Stanley ( 1950) 
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an audience in a prompt and predictable manne} while at the same time 
epitomizing the principles of his theory. There have long been public 
demonstrations of learned behavior—of conditioned responses, of maze 
learning, of discrimination—but these demonstrations have usually relied 
upon exhibiting the results of earlier training. By contrast, Skinner’s pigeons 
can be brought before a class and taught various “stunts” before the eyes 
of the students. The empirical demonstration that learning is under the 
experimenter’s control is important, and if what happens can be described 
in terms of the system, so much the better for the system. 

The practical use of the system is based on the complementary 
principles of control through presenting and withholding food reward for 
a hungry animal. The supplementary principles of stimulus discrimination 
entiation suffice to inaugurate the method of successive 
approximations. Beyond that, all that is needed is the experimenter’s in- 
genuity. It is not necessary to worry about anything precise in the way 
either of experimental data or of correlated principles. From the point 
of view of a theoretical achievement(this is really a pretty modest, ex- 
tension of Thorndike’s law of effect, Whether or not a child can be taught 
to write a limerick by the same methods can only be known when the 
limerick gets written. The theory does not propose to predict! 


and response differ 
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Intervening Variables and Their Anchorage in Observables 


In the first revision (1929) of his widely used textbook Woodworth 
suggested that we substitute the formula S-O-R for the earlier recom- 
mended S-R formula. The stimulus (S) affects the organism (O), and 


what happens as a consequence, the response (R), depends upon O as 


well as upon S. \Hull’s system may be thought of as a herculean elabora- 


tion of this S-O-R formula. \ 

In our experiments we measure envirenmental influences upon the 
organism (the input), and then measure: + tai---n S¢y’s responses (the 
output). These measures provide firm anchoywod at 


Joc a our data in the en- 
vironment, where objectivity can be achieved-and maintained. Input and 


output are not comprised exclusively of the experimentally studied stimuli 
and responses. Other influences upon the organism can be treated as ex- 
perimental variables, such as prior history of training, deprivation sched- 
ules, injection of drugs, and these influences can be described as objectively 
as stimuli and responses. What goes on within the organism we have to 
infer, and in the course of making these inferences we postulate certain 
intervening variables or symbolic constructs. If we tie these inferences 
firmly to the input-output terms by way of quantitative mathematical 
statements, we lose nothing in objectivity, and gain something in con- 


venience, understanding, and fertility of deducing new phenomena. This 
is the basic logic of the system. 


( Reinforcement the Primary Condition for Habit Formation 


È (In his choice amon 
reinforcement alone, or 


ggested that the Pavlov experiment 
nder Thorndike’s law of effect, and in 


principle.? 


l Reinforcement theory of the kind Hull espoused requires, in the 
specification of a primary reinforcing state of affairs, either d 


as in need satisfaction, or drive-s imulus reduction, as in the satisfaction 
of a craving rather than a need. (Hull earlier held to the drive 
theory, most clearly exemplified in escape from a continuing noxious 


1 Spence (1952a), page 646. 
2 Hull (1935a) (1937). 


rive reduction,” 


-reduction . 


y 
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stimulus, such as an electrically charged grid. The activity that terminates 

the noxious stimulus is reinforced because the need to escape injury is 

satisfied in the escape. Later, under the influence of the Miller-Dollard 
theory of drive-stimulus reduction,” Hull went over to the drive-stimulus 
reduction theory and abandoned the drive-reduction interpretation of 
reinforcement.\The example of escape from a noxious stimulus was equally 
cogent, though’now the reduction in pain (the consequence of stimulation) 
is theoretically the basis for reinforcement instead of the escape from in- 
jury (the underlying need) {In primary reinforcement the two events 
(drive and drive-stiy. Ins) + 10 &osely associated that it does not matter 
very much which iseed ind tu be reduced. But needs may require time 
for their satisfaction (as in the time required to digest food) whereas 
incentives (including food) work promptly as | reinforcers, more as stimuli 
might be expected to wor < Furthermore, as secondary reinforcement 
came into greater and greater prominence in reinforcement theories, the 
stimulus-reduction theory became even more attractive. This follows be- 

- cause secondary reinforcement is a function of stimuli rather than of needs. ) 


Integration of Behavior Sequences Through Anticipatory 


Responses 

Most of the primary behavioral laws in Hull’s system were derived 
strumental conditioning (both interpreted as 
illustrations of learning under the control of reinforcement). But the 
appeal of the system rests upon the comprehensive explanation that it 
provides of many phenomena of learning in experiments not classifiable as 
simple conditioning: more complex trial-and-error and discriminatory 
learning, maze learning, rote memorization, tool using, and so on. In order 
to make these further deductions, Hull proposed a number of intermediate 
mechanisms, derivable from the basic laws of his system, but, once derived, 
of very wide applicability. At this point the intent is merely to give some- 
thing of the flavor of these intermediaries; more detailed illustrations will be 


presented later in this chapter. ‘ 

Many of the stimuli present at the time the goal is reached are also 
present earlier. These include the stimuli from the drive (what Guthrie 
called maintaining stimuli), environmental stimuli present both earlier 
and during reinforcement, traces from earlier stimuli persisting to the 
goal, as well as stimuli aroused by the animal’s own movements. Hence 
in repeating a sequence of acts leading to a goal, as in running through a 
maze, there are always enough of these stimuli conditioned to the goal 


8 Miller and Dollard (1941). 


from either classical or in 


p; 
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response to elicit fractions of the goal response prior to reaching the goal. 
These fractional antedating goal-responses (ręs) are very important 
integrators in Hull’s system, and he made very ingenious use of them. 
While they were prominent all along in Guthrie’s system, Guthrie never did 
formalize their use as Hull did. 

The fractional anticipatory responses give rise to stimuli (sq). These 
stimuli sometimes serve as surrogates for directing ideas. In that case the 
fa is a “pure-stimulus act,” that is, an act that serves functionally merely 
to produce a stimulus that maintains a steering role in behavior.‘ 

These same stimuli (sg) are very important iu sepondary reinforce- 
ment, permitting reinforcement originating in f¢od at: the end of the maze 
to move back and reinforce turns far from the goal.° 

Furthermore, it is these same stimuli (sg) that generate the gradient 
of reinforcement and the habit-family hierarchy, important intermediate 
mechanisms, which we shall meet later on. 

Thus, on the basis of primary behavioral postulates, Hull deduced 
types of mechanism that led to wide generality in his deductions, 


The Requirements for a Quantitative, Deductive System 


Hull knew very clearly what he wanted in the way of a formal system. 
Such a system should begin with adequately defined terms, and then state 
a few (as few as possible) basic postulates. These postulates may either be 
very general empirical findings, and thus independently verifiable, or, if 
not directly testable, they must be subject to indirect verification. Their 
purpose is to relate the fundamental intervening vari 
(and quantifiable mathematical equations) to e 
anchorages in environmental events. These postulates, taken together with 
the definitions, will generate new testable deductions or predictions, These 
are the corollaries and the theorems of the system. 

The progress of science comes about through the experimental testing 
of theorems. When agreement is found, the postulate system generating 
the theorem remains in the running; when disagreement is found, a 
search is made for the faulty postulate, if such there be, and the postulate 
is then revised. Possibly, if no faulty postulate can be found, a new postu- 
late may have to be added. Thus the system is self-correcting, and only 


those features survive that have stood the scrutiny of meticulous experi- 
mental testing. 


ables by strict logic 
ach other and to their 


Systems of this kind begin as “miniature” ones. That is, they encom- 


4 Hull (1931). 
* Hull (1952a), page 14, 
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pass at first only a limited range of data. Even these limited data put the 
early system under strain, and the next steps consist in trying to secure a 
better fit between the theory and these data. Then additional data are 
incorporated through extensions of the system. Sometimes two or more 
“miniature systems” may be combined, as appropriate bridging concepts 
are developed. There may be major shifts in the systematic formulations, 
as when, in physics, Newtonian theory became a special case under 
Einstein’s more general theory. Thus theory construction proceeds by 
successive approximations, and finality in a theory is often a sign that 
the theory is faulty. 

Did Hull succeed in producing a good system, according to these 
criteria? A satisfactory answer may perhaps have to wait for the verdict 
of history. If a generally acceptable theory of learning eventually emerges 
with convincing evidence of its ancestry in Hull's earlier formulations, he 
will have succeeded. If the direction in which he moved turns out to have 
been a blind alley, and greater success comes from a fresh start in new 
directions, then his success will not have been as great. Even so he will 
have done a profoundly valuable service in showing what a mature system 
might look like, and in insisting that it was not too early to attempt to 


systematize psychology in rigorous fashion. 


THE FINAL SYSTEM 


Earlier Sets of Postulates 
Hull’s theoretical system evolved gradually. He habitually wrote down 


various conjectures and plans for experiments in bound notebooks, twenty- 
five volumes between 1915 and 1951.° Many of his later publications were 
foreshadowed in these notes and in the frequent mimeographed memo- 
randa that he circulated. His first paper addressed specifically to the 
subject matter of his final system was in 1929, entitled “A functional inter- 
pretation of the conditioned reflex.” In it he acknowledged his great in- 
debtedness to Pavlov and began on an informal basis to make the kinds of 
deductions he later sought to formalize. Another important influence upon 
his later theorizing arose out of a summer's teaching at Harvard in 1930, 
when he was invited to Jecture on aptitude testing because of the favorable 
reception of his recent book on that topic (1928). He there met C. I. Lewis 
and other philosophers, and asked them why philosophers had neglected 
his doctoral dissertation ON concept formation.’ Their answers are no 


6 Hull (1951), page 120. 
7 Hull (1920). 
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longer available to us, but in any case he added to his library, and read, $ 
Newton’s Principia, and Whitehead and Russell’s Principia Mathematica. 
These raised his sights as to the kind of theory to which he might aspire. 
The first formal system, using definitions, postulates, and theorems, ap- 
peared in 1935.8 It was what he called a “miniature system” concerned 
with rote learning, to be elaborately worked over later in the most detailed 
of Hull's formal systematic efforts, the collaborative book entitled Mathe- 
matico-deductive theoriy of rote learning.® 

The true ancestor of the final postulate set, however, was contained 
in Hull’s presidential address before the American Psychological Associa- 
tion at Dartmouth in 1936.!° This new “miniature system” was concerned 
with adaptive behavior of the kinds reflected in t 


he later postulate sets 
in Principles of behavior (1943), Essentials of behavior (1951),1* and 
A behavior system (1952a). The fi 


nal book was completed only a few 
weeks before Hull’s death and appeared posthumously. It was written 
while he was suffering from a heart condition and could work only a few 
hours a day. It is a tribute to his devotion to his work that it was finished at 


` all. Some of the changes in the latest version may very well turn out not 
to have been improvements, The system was by no means a finished one, 
and Hull was the first to acknowledge this. The promised books on in- 
dividual differences and on social behavior did not get written, and the 
final book is full of qualifications and reservations, If therefore we base 
the exposition on the final book, and have some faults to find 


not be taken as disparaging to Hull. The book c 
intended, and the details would have to 


case. Such was the kind of system Hull 


In the pages to follow we sh 
corollaries of the fina 


, this need 
arries the flavor that he 
be filled in and corrected in any 
envisaged. 


all trace through the postulates and 
stem one at a time, and then summarize the 


system, before turning to some of the deductive consequences of the 


system. There are a total of 17 postulates and 17 corollaries. An introduc- 


tion to these postulates and corollaries necessarily has to be somewhat 


8 Hull (1935a), 

9 Hull and others (1949). 

10 Hull (1937). 

* For an introduction to the postulate set of 1943, s the earli iti i 
book, Hilgard (1948), pages 80-91. » See the earlier edition of this 

12 This short book contains, with a few slight differences, the postulates in the final 
a oy the same postulates were listed in an earlier journal article, Hull 
a). 
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of the postulates and corollaries to their essentials through paraphrasing 
and eliminating some of the quantitative mathematical expression. 


The Sensitive and Responding Organism “ 


The organism, before it learns, brings with it certain tendencies to 
respond and the necessary sensory equipment to be affected by the en- 
vironment. Huli b first postulates acknowledge these starting points for a 


stimulus-response theory. 


Postulate 1. Unlearned stimulus-response connections 


Organisms at birth possess a hierarchy of need-terminating responses that 
are aroused under conditions of stimulation and drive. The responses activated 
by stimulation under conditions of need are not a random selection of the organ- 
ism’s responses, but are those most likely to terminate the need.18 


The first postulate recognizes a kind of biological trial and error 
favoring survival, based presumably on evolution. It actually plays little 


role in the system. 


Postulate 2. The molar stimulus trace and its stimulus equivalent 


A. Stimuli impinging upon @ receptor give rise to afferent neural impulses. 
These constitute a “self-propagating molar afferent trace impulse,” providing 
the equivalent of an increasing stimulus, reaching its maximum in about 450 
seconds, 5 j m 

B. A new phase follows the maximum. This subsiding phase of the molar 
afferent trace impulse gives rise to a second stimulus equivalent. The subsiding 
phase endures longer than the first (recruitment) phase. 

C. The intensity of the molar stimulus trace (s) (i-e., the effective stimulus 
in associative learning) is a logarithmic function of the molar stimulus equivalent 


of the trace." 

/the second postulate sets the stage for one of the asserted funda- 
mental conditions for associative learning: the contiguity between stimulus 
and response. As ‘with Guthrie, the contiguity that counts is inside the 
organism—in Hull's terms the coincidence between the molar stimulus 
trace and effector activity. Hence the second postulate is introduced to 


define the effective intensity of the process initiated when an external 
stimulus impinges upon å receptor. The argument leading to parts A and B 


13 Paraphrased from Hull (1952a), page 5; (1951), page 4; (1943), page 66. 
14 The three parts of this postulate are parap reet from, Hull (1952a), page 5; 
(1951), page 11. The earlier postulate, Hul (1943), page 47, has been considerabl: 
revised in order to make use of quantitative data from the experiments of Reynold, 
(1945a) and Kimble (1947). How he uses their data can be found in Hull (1952a), 
tions stated in quantitative form in Hull’s postulates 


pages 101-104. Because the equa’ A € 
are not essential to understanding the postulate set, the essence is here reported in verbal 


paraphrases. 
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of the postulate is as follows. If stimulus and response must be coincident 
in time for conditioning to occur, the most favorable interval between 
conditioned and unconditioned stimuli must be that at which the maximum 
after-effect of the conditioned stimulus coincides with the beginning of 
the unconditioned response. From the experiments of Kimble and of 
Reynolds this is known to be at an interval of about -450 seconds. Hence 
this interval appears in postulate 2A as the interval of maximum molar 
afferent trace impulse. The form of recruitment and of subsidence is the 
form of the curves obtained by Kimble and Reynolds with various intervals 
between conditioned and unconditioned stimuli. 

The introduction of the logarithmic term in 2C is a 
cession to psychophysics. The Weber-Fechner law tau 
a roughly logarithmic relationship between the sensory response and the 
physical stimulus. This logarithmic transformation may conceivably take 
place within the receptor, within the afferent processes, or within the 
cortex. Hull has rather casually assigned the transformation to the afferent 
process—the molar stimulus trace bearing a logarithmic relationship to 
the molar stimulus equivalent of the trace.® 


pparently a con- 
ght us to expect 


( Motitation and Reinforcement \ 


We now come close to the heart of the theory, which is unified 
the conception that learnin 


forcement. 


around 
g takes place only as a consequence of rein- 


Postulate 3{ Primary reinforcement 


When a response (R) is closely associated with a stimulus trace 
this stimulus-response conjunction is associated with 
produced stimuli (Sp) 
stimulus trace (s) to ev 


The rapid decre 


(s) and 
a rapid decrease in drive- 
there will result an increase in the tendency for that 
oke that response (R). - 
ase in the goal stimulus (sq) is also reinforcing..5 ` 
We may note that the a: 


ssociation which gets strengthened is alike 
for both Guthrie and Hull: 


a contiguous stimulus-response connection, 


ethaps classify Hull 
psychophysics, as contrasted with Fechner’s “inner” 
psychophysics, See Dodge (19 


ween decrease in drive (D) 
e postulate on primary rein- 
possibility that even when 
atisfied but some derivative 
use 7c and sa are central to 
in considering the system as a 
See the discussion of Corollary 


such as taste or appetite. Beca 
secondar uld probably be wise, 
whole, t Sa in Postulate 3. 
xv, page 148, 


I 


( Corollary ii. Secondar' 
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But for Guthrie it gets fixed merely because it occurs; for Hull it gets 
strengthened only because it occurs in association with reinforcement. X 
Primary reinforcement is brought about, according to the postulate, by 
the decrease in the stimuli produced by a drive, for example, the removal 
of a noxious stimulus, the reduction of hunger pangs (or by a decrease 
of stimuli associated with the anticipated satisfaction of these drives). Hull 
notes that he has vacillated somewhat in his earlier discussions between 
a reduction in need or drive (D), and a reduction in drive stimuli (Sp). 
He ends up with the latter, but with some reservations, and, in fact, leaves 


the matter open for revision.” 


Corollary i. Secondary drive 

When a neutral stimulus trace (s) has been closely associated with the 
evocation and rapid decrease of drive-produced stimuli (Sp), the hitherto neu- 
tral stimulus trace (s)) acquires a tendency to bring about these drive stimuli 
(Sp), so that the previously neutral stimulus trace (s) becomes the occasion for 


a secondary drive (s — Sp) 

This corollary expresses an association between two stimuli: the 
neutral stimulus trace (s), and the drive stimulus (Sp). Such an association 
is foreign to Hull’s system, for learning must always involve a response 
term, Actually a response intermediary is implied, even though not stated. 
This may be illustrated by the acquisition of fear as a secondary drive. 
The primary drive is pain, producing the original drive stimulus (Sp). 
Neutral stimuli associated with pain give rise to “fear” responses, very 
similar to responses to pain, and the proprioceptive consequences of these 
learned responses produce the drive stimulus (Sp) that serves the second- 
ary drive.!° The corollary, however, asserts nothing about this response 


intermediary. 
y rei nforcement 


utral stimulus trace (s) has been closely associated with a rapid 


(When a ne eB r 
ve produced stimuli (Sp), the hitherto neutral stimulus trace 


diminution in dri Q 
(s) acquires a tendency to bring 
neutral stimulus trace (s) acquires 


e seeming formality of his exposition, Hull used many 
hen stating his postulates, so that precise meanings 
nfer. I have restated the two corollaries in closely 
do. Then it becomes evident that a neutral 


“the power of acting as a reinforcing agent.” *° 


Despite th 
variations in wording W. 
are sometimes hard to i 
parallel form, which he did not 


17 Hull (1952a), page 153. P 
2B — a Fal (1952a), page 6; (1951), page 25; (1943), page 98. 


19 N, E. Miller (19482) (1951b). a 
20 Paraphrased from Hull (1952a), page 6; (1951 ), page 28; (1943), page 95. 


about the reduction of Sp, so that the previously |. ~ 
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stimulus trace associated consistently with a reinforcing state of affairs 
acquires two functions (and both at once!): (1) the power to arouse a 
secondary drive (Corollary i), and (2) the power to reduce drive stimuli, 
and hence act as a secondary reinforcing agent (Corollary ii). 

The distinction between secondary drive and secondary reinforce- 
ment is an important one, and practical examples can be found to make 
the distinction clear. For example, fear and anxiety easily classify as 
secondary drives, and their reduction is reinforcing. Similarly, distinctive 
food-boxes, or poker-chips used as tokens, readily classify as secondary 
reinforcing agents. A chimpanzee will work for a token only when hungry: 
the drive remains the usual hunger drive, but the secondary reinforcing 
agent is not itself hunger-satisfying. Hull intended to preserve this distinc- 
tion in the corollaries, but the formal statements do not specify when the 
neutral stimuli will evoke Sp (then serving as secondary drives), or 
when they will reduce Sp (then serving as secondary reinforcers), The 
only hint of a difference is that for the acquiring of a secondary drive 
the neutral stimulus must be present at the time of drive evocation as 
well as drive reduction, while, for the acquiring of secondary reinforce- 
ment, the neutral stimulus has to be present only at the time of drive re- 
duction. 

The attempt to interpret the actual events in secondary reinforcement 
came late in Hull's theorizing, and he probably had not developed a 
theory fully satisfactory to himself. That he was attempting to hold to a 
consistent drive-stimulus reduction theory of secondary reinforcement (as 
well as of primary reinforcement), is clear from the following statement: 

It follows that any stimulus consistent] 
situation will through that association ac 
tioned inhibition, i.e., a reduction in stim 


the resulting reinforcement. Since this 
quired through learning, 


y associated with a reinforcement 
quire the power of evoking the condi- 
ulus intensity, and so of itself producing 
indirect power of reinforcement js ac- 
it is called secondary reinforcement?! 

It can be pointed out that all conditi 
secondary reinforcing powers because th 
with a reinforcing state of affairs, 

While as paraphrased the corollaries deal only with secondary drives 
and reinforcements, they are applicable also to higher-order derivatives, 
so that tertiary drives can be built u 


Pon secondary ones, and higher- 
reinforcements can be based upon higher-order drives, 


oned stimuli must develop some 
ey must necessarily be associated 


order 


21 Hull (1951), pages 27-28. 
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\ 
\ 


The Law of Habit Formation ) 

Having chosen contiguous association under reinforcement as the 
basic qualitative condition for habit strengthening, Hull had an additional 
basic choice to make regarding the quantitative influence of reinforcement. 
Either reinforcement could contribute its maximum influence on one trial 
(the possibility accepted as a genuine one by Skinner, and corresponding 
to Cuthrie’s one-trial learning), Or reinforcement could add an increment 
to habit strength on each occurrence. Hull chose the latter alternative, 


\Postulate 4, Habit strength (sHp) as a function of reinforcement 


Habit strength (the tendency for a stimulus trace to evoke an associated 
response) increases a5.a positive growth function of the number of trials, pro- 
vided that trials are” evenly spaced, reinforcement occurs on every trial, and 


everything else remains constant.** 


The function described as a positive growth function is of the form: 


son. = 1— 10-8 


where N is the number of trials from absolute zero, and q is-an empirical 
constant. of the order of (.03)as stated in the original postulate. Such a 


function is used throughout Hull’s quantitative formulations. It can be 
unction with a maximum at 1.00, this maximum being 


a manner that each unit of reinforcement increases 
tant fraction of the possible habit strength as yet 
a curve of decreasing gains. Because the 
f habit was originally defined as a hab, a 


understood as a f 
approached in such 
habit strength by a cons 
unrealized. This is, of course, 
total range is 1,00, the unit o 


centigrade unit extending by steps of .01 to 1.00.” . 
While the postulate shows habit strength to be a function solely of 


the number of reinforcements, we cannot tell from this postulate alone 
what other influences there may be upon habit strength. Because it is 
specified that everything else is constant, it might be that habit strength 
would be different with a given number of reinforcements if, say, drive 
ticipate a little, however, and point out that 


were different. We may an 
there are no systematic influences upon sH other than the number of re- 
inforced trials. It would actually have been unnecessary to state in the 


postulate that everything else is constant, for nothing else can affect sHr 


directly.** 
51), page 82; (1948), page 114. 


Hull (1952a), page 6; (195 
is 38 The hab is dropped in Hull (1952a), and sHr becomes a 


t any units. 
etween 1951 and 1952 is the dropping of the in- 


t changes b a t 
Jus intensity during learning. See footnote 28, 


22 Paraphrased from 
28 Hull (1951), pag 
multiplying factor withow' 
24 One of the importan 
fluence upon sHx of stimu 
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This is a radical departure from the earlier system, because now 
most of the influences, as we shall see, are upon reaction potential (sEr) 
rather than upon habit strength (sHn). The distinction between per- 
formance and learning, so long insisted upon by Tolman, was formally 
accepted in the 1943 postulates, but by now has greatly reduced the 
quantitative influence of reinforcement upon associative learning. We 
would expect the magnitude of each reinforcement to be pertinent in 
determining the contribution of reinforcement to habit strength, but we 


are now told that this is not the case, so long as some unspecified minimum 
of reduction in Sp occurs. 


Non-associative Factors Affecting Reaction Potential 
Reaction potential (sER) is Hull’s term for the inferred process close 
to response evocation. It can be inferred onl 
to be identified with response. Reaction potential and response are not 
identical because reaction potential may be below threshold, hence not 
lead to response, or it may interact with competing tendencies 
incompletely revealed in response, 

| We may conveniently think of sHp as signifying associative strength 
and of .E, as signifying the strength of the tendency to respond, At a given 
level of habit strength (sHx) the magnitude of the reaction potential (sER) 
depends in addition upon three chief non-associa 
at the time of response evocation. These are spe 
postulates. 


/ 


and hence be 


tive * factors operative 
cified in the next three 


Postulate 5. Primary drive (D) 


A. A primary drive (at least that resultin 
sists of two components; (1) the drive 
of hours of food deprivation, and 
drive as starvation continues, 

B. Each drive condition generates 
which is an increasing function of the driv 


C. Some drive conditions may motivate into action habits set up on the basis 
of different drive conditions,26 


g from food deprivation 
proper, which increases with the n 
(2) an inanition component which r: 


) con- 
umber 
educes: 


a characteristic drive 


stimulus (§ 
e condition. (Sp) 


According to Spence, drive (D) 
state or general condition of the ne 
needs contribute.” *? Hence the ge 


is conceived by Hull as “a nonspecific 
Tvous system to which all the specific 
neral drive level is due not only to the 


25 Spence (1951a). 
26 Paraphrased from Hull (1952a) Pages 6-7; (1951) 2 
, i > pages 28-40; (1943) 
253. The postulate as stated by Hull has four arts, one of which is į page 
illustrative purposes only. It describes the E li A introduced for 
experiment (Yamaguchi, 1951), and is omi 


ctl as measured in one 
pe ed here as unnecessar: 
“7 Spence ( 195la), page 249. y tatie er 


y from response, but is not ~ 
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relevant drive but to all needs coexisting at the time. The drive stimulus 
(Sp) is more nearly specific to a given drive, so that it can provide for 
discrimination between drives as well as for reinforcement. There is some 
interchangeability among drives as acknowledged in the final part of the 


postulate. 
Drive activates habit strength into reaction potential. The simplest 


form of statement is: 
sEr =D X sHr 


This means that a habit of given strength will yield responses of 
greater or less magnitude, depending upon the level of drive operating 
at the time the response is evoked. For the sake of simplicity, this and 
the two following postulates assume that the levels of the non-associative 
factors have remained the same during learning and testing, in order to 
avoid the complications that ensue when a different level of drive is used 
in learning and in testing. This assumption makes it possible to provide 
quantitative expressions for the postulates with greater ease. The intent 
that drive strength has its true influence at the time of 


is, however, 
is, that it is a non-associative factor. 


response evocation, that : 
The concept of drive is very important in Hull’s system. It has three 

distinct functions: 
1. Without drive there could be no primary reinforcement, because 
primary reinforcement requires the rapid diminution of Sp. There could, 
ary reinforcement either, for secondary reinforce- 


of course, be no second 
ment originates in the association of a stimulus with primary reinforce- 


ment, 

2. With 
strength into reaction potential, Dri 
sHr ` 


out drive there could be no response, for drive activates habit 
ve is the most important multiplier of 


3. Without the distinctiveness of the drive stimuli Sp there could 
be no regulation of habits by the need state of the organism, ‘no way to 
learn to go one place for water when thirsty, another place for food when 
hungry. ; ; ao 
The first and third of these functions together determine which in- 

‘ell as why it is reinforcing; the second function 


centive is reinforcing, as W ó á : i 
is that of a goad to action in the direction of need satisfaction; the third 


i i iscriminati eering role. 
also gives drive a discriminative or st g 


y dynamism (V) 
f the stimulus, the greater the reaction potential 
ength.\The magnitude of the stimulus-intensity 


Postulate 6, Stimulus-intensit 


The greater the intensity © 
for any given level of habit str 
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dynamism (V), as a component of reaction potential, is an increasing logarithmic 
function of the stimulus intensity. 

The basic equation parallels that for drive: 


sEr = V X sHp 
Postulate 7. Incentive motivation (K) 


The greater the magnitude of the incentive used in reinforcement, the 
greater the reaction potential for any given level of habit strength. The incentive 
motivation (K), as a component of reaction potential, is a negatively accel- 
erated increasing function of the weight of food or quantity of other incentive.2" 


Again it must be noted that the influence of amount of the 
agent during learning is upon reaction 
strength: 


reinforcing 
potential and not upon habit 


sEp = K x sHp 
We are dealing here with constant values of K within tr: 
testing, so that the question of shift of incentive between le: 
testing (as in latent learning experiments) must wait for further discus- 


sion. So, too, the problem of the influence of the perceived incentive, as 
part of response evocation, will be discussed later. 


aining and 
arning and 


Postulate 8. The constitution of reaction potential (sEp) 


When conditions have been constant throughout learning and response- 
evocation, the reaction potential (sEr) is determined by the habit strength 
(sHy) multiplied by (1) the drive (D), (2) the stimulus-intensity dynamism 
(V), and (3) the incentive motivation (K) 80 

The general equation becomes: 


sER =D xX VX Kx Hp 
This postul 


into one equation. Several statements need 


28 Paraphrased from Hull (1952a 
appear in Hull (1943), A chan 


), page 7; (1951), page 46. The concept does not 
1952 version V:, the V mean 


1951 and 1952, in that in the 
29 Paraphrased from Hull 


2 ations i 
acs ie are considerably altered from Hull (1943), 
*1 Hull (1951), page 58, states 


& that the maximum hunger dri bi i 
Fem its 12 oe ee ey Sates g ve ranges “uncertainly 
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is measured in the units of sEx. The other multiplying terms are merely 
decimal multipliers or weights, without units, each with a maximum of 1.00. 
Were sHn, V, and K all at the maximum, sEx would be equal to D. Any 
other values of sHr, V, and K make sEx less than D. A zero value for any 
multiplier (including D) makes sEp assume a value of zero. We shall re- 
turn later to a discussion of the measurement of sEx. For the present, we 
may note that the forms of the functions underlying sHn, D, V, and K 
have some degree of independent determination, but their units are en- 
V, and K the arbitrary maximum is 1.00, and 
they are pure numbers, without any kind of unit. Both the maximum value 
and the units of D are obtained by working backwards from sEr. Had D 
been treated symmetrically with sHp (as it was when the unit called mote 
was assigned, paralleling hab) ® then any one of the other terms could 
just as well have taken on the larger range of values. It is important to 
note how a tour de force of this kind makes possible the multiplication of 
four quite disparate quantities without causing the systematist major 
difficulties, A preferable solution (and an equally simple one) would 
have been to preserve the uniform (arbitrary) scale for each of the factors, 
and then to introduce an arbitrary constant to convert to the scale used for 
sEr. This would have made no difference in the end, and would have 
avoided the illusion that D was assigned more importance than the others 
on the basis of some kind of empirical evidence. It turns out to be merely 
the residual legatee of the constant required to convert to the units of 


sEr. 
Corollary iii. Delay in reinforcement (J) 
A. For a response in a chain, the greater the delay in reinforcement, the 


i i i hat response. 
weaker the reaction potential leading to t sens) 
B. For a single F SpOnse, the greater the delay in reinforcement, the weaker 


the react jal, according to a gradient that falls off rapidly at first and 
thes GEE vpo approaching She asymptote at about 5 seconds.** 

The delay-in-reinforcement gradient has been very important in Hull’s 
theorizing ever since his early paper on the goal gradient.** The simple 
delay-in-reinforcement gradient (Part B of Corollary iii) became shorter 


3, At that time the decimal was omitted, so that both sHr 
of 100. 


tirely arbitrary. For sHr, 


32 Hull (1943), page 24 


and D had maximum values ‘ : 
BS Pergnhinced: frorn Hull (1952a), pages 7-8. The equation stated by Hull is based 


Ea ination, (1952a), pages 126-133. Hull himself had serious Á 
Sati eee ae assumptions involved, (1952a), page 181 (footnote), page 
183, The status as a corollary is @ change from (1951), page 56, where J was a multi- 
plier equivalent D, V and K. It was then a modification from (1943), pages 145, 178. 
34 Hull (1932}. 
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as Hull's system evolved. The earlier gradient for a single reinforced 
response extended for 30 to 60 seconds, while the final one extends but 
5 seconds." 

The shortening of the gradient came about as secondary reinforce- 
ment gained more prominence in the generation of the longer gradients. 
By placing the gradient in a corollary rather than a postulate, Hull ac- 
cepted Spence’s suggestion that the primary gradient is probably absent 
altogether, all gradients being generated through secondary reinforcement 
or other intermediate mechanisms.** 


Corollary iiiA has the logical status of a theorem, extending the 
gradient to cover links in a chain of responses, 


Behavioral Summation and Withdrawal 


The next four corollaries deal with a complication of habit strength 
and reaction potential when two stimuli (along a continuum) have been 
associated with the same response through reinforcement. 


Corollary iv. The summation of habit strengths when two stimuli S and S’ 
been separately associated with a response through reinforcement 

Corollary v. The summation of reaction potentials when two stimuli S and S’ 
have been separately associated with a response through reinforcement 

Corollary vi. The withdrawal of habit strength when S’ is withdrawn from the 

combination of S and S’ as in Corollary iv 

Corollary vii. The withdrawal of reaction potential when S’ 


the combination of S and 8’ as in Corollary v 37 


have 


is withdrawn from 


These four corollaries all revolve about the same b 
The combined habit strength is said to be composed of (1) the primary 
habit strength (sHr) to the stimulus more frequently followed by a rein- 
forced response, and (2) the additional habit strength to this stimulus 
(sFr) generalized from the habit strength acquired by the second stimulus 
(s'Hr). The sum is less than that obtained by direct addition of these 
two components. To compute the sum you may first convert both of these 
habit strengths (sHy and sHr) to the number of reinforcements that 
must follow response to S in order to reach these strengths. Add these 
two N’s and from this sum compute the equivalent habit strength. This 
will be the combined habit strength you seek. An algebraic simplification 


asic assumption, 


* For the earlier gradient, see Hull (1943), page 145, for the later one, Hull (1952a) 
page 131. á 
36 Spence (1947), 


81 Corollaries iv through vii appear in Hull (1952a), pages 8-9, and i 
Pages 63, 65, 67, and 69. The earlier discussion appeared in (1943), page ZA lines 
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reduces the computational complexity and permits computation directly 
from the habit strengths: 


sHr + sHy = sHr + sH, — sH, X sHp (Corollary iv) 
The same argument applies to the computation of the combined 
reaction potential, and the steps are reversed in the withdrawal of habit 


strength and reaction potential. 


Corollary viii. Behavioral summation of incentive substances °8 


The corollary as Hull states it indicates a method of determining 
the relative weights to be assigned different incentive substances (e.g., 
water and food) in order to compute a combined value of K, the incentive 
reinforcement, when both substances are used together. 


Inhibition 


The fundament 
is represented by experime 


al form of inhibition in most conditioning theories 
ntal extinction—the reduction in response with 
repeated non-reinforcement. One interpretation is that extinction is a result 
of competing response tendencies. This interpretation requires no new 
principle, and no negative response tendencies have to be computed. It 
is the position we have met in the views of both Guthrie and Skinner. The 


second interpretation calls for a special theory of inhibition, producing a 


negative reaction potential. Hull adopts this second kind of theory. The 
empirical and theoretical background for Hull’s theory rests on Hovland’s 


earlier recognition of inhibition-of-reinforcement,” on Neal Miller’s analy- 


sis of the role of muscle strain, fatigue, and pain in extinction and spon- 
d in “anticipatory relaxation,” *° and on Mowrer’s 


taneous recovery, aN 
effortfulness of task in relation to experimental ex- 


experimental test of 


tinction.** 
With this background, Hull suggests/ that two kinds of inhibitory 
potential compete with excitatory potential to weaken it: reactive inhibi- 
litioned inhibition (sIr). These both act as negative 


tion (Ip) and conc 
reaction potentials. They summate to produce a total inhibitory potential 
e reaction potential.’ 


that can neutralize positiv 
(Reactive inhibition (In) is generated whenever a response occurs, and 
pas 
38 The corollary is stated in mathematical form in Hull (1952a), page 9; (1951), 
page 72. y i infi ; i 
59 _ Sometimes continued reinforcement leads to a decline in 
Hovind (1070). al extinction, e.g, Hilgard (1933). 


i çperiment: 
tesponse resemblo e To4T), pages 40-43; Miler CARH) puges 451-452. 


41 Mowrer and Jones (1943). 


138 THEORIES OF LEARNING 


is akin to tissue injury, fatigue, or pain. As such, it acts as a barrier to 
repetition, directly inhibiting reaction potential. It has a second aspect in 
addition, in that it acts (just as pain does) as a drive, strengthening any 
activity associated with its reduction.** The activity that reduces Ir is rest 


. (‘non-activity”), and it is this conditioned non-activity that is known as 
conditioned inhibition (sIr). 


Postulate 9. Inhibitory potential 


A. The occurrence of a response produces reactive inhibition (Iņ) which 
both inhibits reaction potential and acts as a negative drive. 


B. Reactive inhibition (Ig) dissipates spontaneously with the passage of 
time as a simple decay function of elapsed time. 
C. As a given response is repeated, increments of reactive inhibition sum- 


mate. The resulting Ip also summates with conditioned inhibition (sIn) to pro- 
duce the aggregate inhibitory potential (t 


Jis 
D. When non-reinforced responses follow each other at short intervals, the 
aggregate inhibitory potential (1p) increases as a positive growth function of 


the number of non-reinforced trials, thus yielding the phenomena of experi- 
mental extinction. 


E. Because the aggregate inhibitory potential (1,) increases with the 
magnitude of the work involved in each response, the greater the work involved 


the fewer the number of unreinforced responses to a criterion of experimental 
extinction.*# 


Portions of this postulate (especially parts C, D, and E) are intimately 
related to the three following corollaries, 


Corollary ix. Conditioned inhibition 


Stimulus traces closely associated with the cessation of a response, in the 
presence of Ig (which is assumed to decrease also when the response ceases) 
become conditioned to the particular non-activity. This conditioned non-activit 
is known as conditioned inhibition (sIņ), and opposes reaction potential (Ep). 
The amount of sIp generated increases with the amount of Ip present.*4 


The systematic considerations here are actually very complex, and 
they are treated rather sketchily by Hull. Becaus 
to have two components, the equivalents of 


treats only the reaction potential ( 
tion (sIn) 


e sIr is a habit, it ought 
sHr and sEr. Actually, Hull 
sEr) equivalent. Conditioned inhibi- 
is said to be a function of the amount of Ip because the more 
Ip there is the more rapidly it can diminish and hence the m 


E m ore reinforcing 
it can be.** This argument reflects outmoded features of the 


system. A form 


42 Hull did not follow up all the implications of treating I i i 
subtract it from D, which as a matter of course his a yautet ae ‘t 
in eee with D as a multiplier in the determination o s 

48 The five parts of Postulate 9 are paraphrased from Hull s 

1952a), pages 9-10; 
(1951), pages 74-81. Antecedents occur in (1943) canis ( Arg i 
44 Paraphrased from Hull 4 er mally paga 800. 


(1952a), page 10; (195 $ 
48 Hull (1951), page 75; Kimble (1949), ae 22, si? Uaa MES n00; 
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of the theory more consistent with the current system would simply make 
a stronger drive (Ir) a larger multiplier, with the same result—a larger 
sIn resulting from larger values of In. 

In Corollary ix, conditioned inhibition is derived from the principle 
of reinforcement (Postulate 3) by interpreting inactivity as a response and 
Ip as a drive. Hence it is legitimately a corollary rather than a postulate, 
although Hull is not always clear about his basis for classifying the several 


principles. The corollary is necessary to provide a definition of conditioned 


inhibition (sIn) already made use of in Postulate 9C. 


Corollary x. Aggregate inhibitory potential as a function of work 


The aggregate inhibitory potential that is generated through complete ex- 
tinction requiring a fixed number of trials is an increasing function of the 
magnitude of the work involved in each response.*® 
Corollary xi. Aggregate inhibitory potential as a function of the number of re- 

sponses 

The aggregate inhibitory po! 
tinction requiring a fixed magnitu 
tion of the number of trials.*7 

Corollary x follows from Postulate 9E, but restates the postulate for 
a constant number of trials to complete extinction, instead of a constant 
reaction potential. A good guess as to the reason for this and the following 


corollary is that scaled data were available from experiments based on 


tential that is generated through complete ex- 
de of work per response is an increasing func- 


responses to extinction.** 
Corollary xi bears a corresponding relation to Postulate 9D. The 


postulate states the number of trials to extinction when reaction potential 
is constant and the work per response varies, while the corollary states 
the number of trials to extinction when work is constant and reaction 
potential varies. This implication of the corollary is not self-evident, until 
we remember that the total amount of inhibition generated during ex- 
tinction must equal the total amount of reaction potential that it must 
neutralize. Hence with constant work, the only independent variable im- 
plied in Corollary xi is the amount of reaction potential to be overcome 


through the generation of inhibition during extinction. 


ce and Interaction of Stimuli 
o two postulates concerned with stimulus interrela- 
when one stimulus is to some degree 


Equivalen 


We now come t 
tionships, the first with generalization, 
46 Paraphrased from Hull (1952a), page 10; (1951), page 84; (1943), page 279. 


47 Paraphrased from Hull (1952a), page 10; (1951), page 85. 
48 The ‘experiments are those of Mowrer and Jones (1943), Perin (1942), and 


Williams (1938). 
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equivalent to another, the second with afferent stimulus interaction when 
two or more stimuli are active at once. The earlier corollaries that dealt 
with interaction effects (Corollaries iv through viii) considered the sum- 
mation and withdrawal of reaction potential, once habits have been set 
up. Here the concern is with the properties of stimuli and with the changes 
in stimulus traces when more than one stimulus is active at once. 


Postulate 10. Stimulus generalization 


A. The generalized habit strength (s,H,,) associated with a stimulus trace 
s, differing in quality from the stimulus trace s, involved in the habit s,Hp) 
depends upon the remoteness of s, from s, on a qualitative continuum in units of 
discrimination thresholds (j.n.d.’s). 

Provided D, K, and V remain constant, the 
(s,Ep) will vary directly with s,Hp. 

~ B. The generalized habit strength (s,H,) associated with a stimulus-trace 
(s,) differing in intensity from the one (s,) involved in the prior leamed habit 
(s,Hy) likewise depends upon the remoteness of s, from s,. Because the differ- 
ence is one of intensity, log units may be used instead of j.n.d.’s in determining 
the stimulus separation. The amount of generalization, furthermore, is a function 
of V,, the stimulus-intensity dynamism of 8, during conditioning. 

Provided D and K remain constant, the generalized reaction potential 
(s,E,) will vary directly with s,Fty and V., the stimulus-intensity dynamism of 
S, during testing. 

C. Generalized conditioned inhibition (s,In) follows the same principles 
of generalization from s,Iņ along qualitative and quantitative stimulus continua 
as generalized habit strength and generalized reaction potential,5° 


generalized reaction potential 


The postulate means essentially that the more alike two stimuli 
the more nearly one can substitute for the other in arousing conditio 
responses. The differences between stimuli along a dime 
description of a gradient of generalization along this dimension. Such a 
gradient is implied in each of the three parts of the postulate. A number 
of closely related attempts have been made to derive the form of the 


gradient from empirical data, and to apply the results to problems of 
stimulus equivalence and to discrimination," 


are, 
ned 


nsion permit 


Corollary xii. The generalization o 


f habit strength and reaction potential with 
change in strength of drive 


When a response is learned under one intensity of drive and tested under 
another, both generalized habit strength (,H,) and generalized reaction poten- 
49 This use of V: appears to be a “hangover” from the 1951 postulate set. This is the 
only place in the final postulates where habit strength is any longer a function of 
stimulus intensity. See footnote 28. 


5 The postulate in its three parts is paraphrased from Hull 
(1951), pages 88-92 (with minor systematic differences); (1943), pages 199, 264. 
faas ge pee aga) (1939) (1943); Hull (1939a) (1947b) (1949) 

; “or related papers of his own, and a critical ji hi 7 
formulations, see Roast 71949). R VEVE 9 Mhie: Hull Spence 


(1952a), pages 10-11; 


i 
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tial (,E,,) will be decreased, their strengths following a gradient based upon the 
amount of difference in drive strength during learning and during testing.®? 
The meaning of this corollary is not clear without some explanation. 
Drive-stimulus habit strength (s He) is always a component of the total 
habit strength resulting from learning. (We have not met this in exactly 
this form before, but Sp is always present with the response that is being 
reinforced, and so enters into association with it.) Hence under a new 
drive intensity, the total generalized habit strength will have as a com- 
ponent a reduced generalized drive-stimulus habit strength (s An). The 
reduction follows from Postulate 10, the new drive stimulus being at some 
e original one. It follows that as a consequence of the 
mulus, the resulting total generalized habit strength 
Il be less under a new drive intensity than under 
amount less depending on how far apart 


distance from th 
change in the drive sti 
or reaction potential wi 
the original drive conditions, the 
the drive conditions of learning and testing happen to be. 

It comes as something of a surprise to learn that D, as a multiplier 
of habit strength, holds during learning and testing when D remains con- 
stant, but at the time of testing a change in D always reduces reaction 
potential, even though D is increased. This is all the more surprising be- 
cause the other multipliers (V and K) continue to operate as multipliers 
with changed intensity, increasing reaction potential as they increase, 
and decreasing reaction potential as they decrease. 

The empirical data which give general support to the corollary are 
those of Yamaguchi (1952). He found that with starvation periods ranging 
from 3 to 72 hours, with tests at various in-between periods of food 
deprivation, tested responses tended to be at minimum latency when the 
testing and training drives were alike. When tested at other intervals of 
deprivation, either greater Or less than the training interval, responses 
tended to be of longer latency. These results are in agreement with the 
corollary, or, to put it the other way, these results gave rise to the corol- 
lary. 

A related experiment by Strassburger (1950), perhaps unfamiliar to 
Hull, yielded results conforming better to what one would expect from 
Hull's general postulate system, but disagreeing with this corollary. After 
training under various conditions of food deprivation from 1⁄4 to 47 hours, 
then testing under a uniform deprivation of 23 hours, he found no dif- 
ferences in the number of responses to extinction that could be attributed 
to differences in the drive level at the time of original learning, but he did 


5 52 Paraphrased from Hull (1952a), page 11; (1951), 
236. 


page 90; (1943), pages 235- 
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find differences attributable to the number of prior reinforcements. This 
is just what Hull’s Postulate 4 would lead us to expect. It is not consistent 
with Corollary xii, however, because change of drive at time of testing 
did not of itself reduce responsiveness. 


Postulate 11. Afferent stimulus interaction in external inhibition 


When a response has been conditioned to a stimulus (S,) and one or more 
previously neutral stimuli (S,,S,, . . .) are presented along with S,, the afferent 
impulses from this combination of stimuli interact to yield a new molar impulse 
(8). This new impulse is equivalent to a stimulus at a greater or less distance from 
S, on a qualitative continuum. The resulting generalized reaction potential to š 


will be smaller than that to s,, depending upon their remoteness from each 
other.** 


` The equation defining the amount of reduction in sEp is: 


Er 
be E 
d (in j.n.d’s) = R 


where „Ep is the original reaction potential, ;E, the reaction potential reduced 
by external inhibition because of the presence of the added stimuli. 


, and j a con- 
stant determined by the form of the gradient of generalization. 


It is clear that the equation attempts to quantify the effects, while 
asserting nothing at all about the kind of patterning or other relationship 
among stimuli responsible for these effects. Furthermore, as presented it 
quantifies merely the limited form of interaction found in external in- 
hibition, and does not cover the many other varieties of stimulus interac- 
tion. Parallel equations could have been developed covering some of the 
other matters that Hull discussed less formally, and he intended that a 
research program would determine the circumstances producing the kinds 
of changes he was ready to quantify. 

This postulate permitted Hull to acknowled 
terned stimuli—the essence of the gestalt perception problem. He be- 
lieved that in this postulate he had achieved a rapprochement with gestalt 
psychology, a sign of the maturing of psychological science, 


ge the properties of pat- 


Random Variability of Reaction Potential 


Organisms vary from moment to moment in their abil 
well-established habits. In memorizing nonsense syllables, 
rectly anticipated on one trial may be missed on the next tr 
threshold is never at a fixed value. The threshold is instead 
of central tendency, 


ity to perform 
a syllable cor- 
ial. A stimulus 


some measure 
with a dispersion of values about it. This body of 


53 Paraphrased from Hull (1952a), page 11; (1951), page 95. An earli 
more general statement, is in Hull (1943), page 47. ure et, somewhat 
54 Hull (1942), page 77. 
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facts pointing to momentary variation is encompassed in the following 


postulate. 


Postulate 12. Behavioral oscillation (sOp) 


A. Reaction potential (sE,) varies from moment to moment, the standard 
deviation of the fluctuations serving as a measure of behavioral oscillation 


(sO,). The distribution in leptokurtic. 
B. Behavioral oscillation (sOn) begins with zero dispersion at absolute zero 


of sHp, then rises to an unsteady maximum as sHy increases. 
C. The oscillations of competing reaction potentials are assumed to be 


asynchronous.** 

The concept of behavioral oscillation, as finally embodied in this 
postulate, intrigued Hull as early as 1917,°° and it has had a place in each 
of his formal systems. It is instructive to note the changes that have oc- 
e these changes tell us something about the provisional 
g at this stage in the history of learning theories. 
the response threshold that oscillated. Hence a 
given reaction potential might suffice to bring a response above threshold 
on one trial, and then (without change in the reaction potential) the 
response might fall below the oscillating threshold on the next." 

2. In 1943 behavioral oscillation became an inhibitory appendage to 
reaction potential. Thus the amount of sOn present at the moment of 
responding was always subtracted from the sEr present at that moment. 
The distribution of sOn was assumed to be normal, that is, Gaussian." 

3. By 1952, the oscillation was incorporated into reaction potential, 
and no systematic reason existed any longer for the symbol sOn, it having 
become simply the standard deviation of sEr, and, incidentally, the unit 
according to which reaction potential was quantified. The detailed efforts 
to quantify reaction potential led to several of the refinements reflected 
in the postulate, notably (1) the statement that the distribution of sEr 
is leptokurtic rather than Gaussian, and (2) the discovery that the disper- 
sion of sOn is not constant, but changes with the number of reinforcements. 

The deductive use of the concept of behavioral oscillation was also 
extended in 1952 to account for alternation cycles in trial-and-error learn- 
ing. We shall return to this deduction later in this chapter. 
araphrased from Hull (1952a), pages 11-12. A typo- 


B of the postulate refer to subthreshold rein- 
cements are intended. See also Hull (1951), 


curred, becaus 
nature of system-makin 
l. In 1940 it was 


55 The whole postulate is pi 
graphical error in his book makes part 
forcements where superthreshold reinfo 
Fa corr ae EE ia history of the use of the concept, see Hull (1952a), 
pages 56-57. 

57 Hull and others (1940), page 74, 

58 Hull (1943), pages 314 an 
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Corollary xiii. Response generalization 


A. Response-intensity generalization. When a muscular contraction ; of 
given intensity has been reinforced, that muscle will contract on successive trials 
with varying intensities in a distribution about the central reinforced region. 

B. Qualitative-response generalization. When the contractions of several 
muscles produce a given habitual act, the combination of the independent varia- 


tion of each produces a qualitative change from the central result originally rein- 
forced.®® 


We have met qualitative change in response in Skinner's treatment 
of response differentiation and in his use of spontaneous changes to move 
by successive approximations from one response to another. Hull had 
something of this sort in mind in this corollary. 

Hull mentioned the corollary specifically three times in the 1952 book. 
He referred to it first by showing how, through reinforcing only the re- 
sponses of appropriate intensity, you can shift response intensity along 
a continuum. He pointed out, second, that locomotion is a highly general- 
ized form of response. Walking to one point in space does not differ 
fundamentally from walking to any other point. Third, he referred to the 
principle in accounting for the shift from moving toward a goal to dragging 
the goal-object toward the subject, as in stick and hoe experiments." 


Reaction Potential and Evoked Response 


The postulates have moved gradually from stimulus-traces through 
habit strength, drive, and other intervening variables determining reaction 
potential, until now we are ready to consider the emergence of the ob- 
servable, recordable response. Reaction evocation requires, first of all, 
that reaction potential must be above the threshold of response. 


u P Absolute zero of reaction potential (Z) and the reaction threshold 
sLr 


A. The reaction threshold 
tion potential (Ep). 

B. Response evocation occurs only when the momentary reaction potential 
exceeds the reaction threshold.* 


(sLy) lies above the absolute zero (Z) of reac- 


The “absolute zero” referred to here is a “relative” absolute, for it is 
based empirically on the number of reinforcements which must be de- 
livered experimentally before there is indication of learning. In the illustra- 
tions used by Hull, the order of magnitude of reinforcements between 


59 Paraphrased from Hull (1952a), page 12; (1943), page 319. Th 
not appear in Hull (1951). ms ooro lar as 


80 These three mentions can be found in Hull (1952a), pages 200, 218. 
®t Paraphrased from Hull (1952), page 12; (1951), pase 102; (1943). a 


HULL’S SYSTEMATIC BEHAVIOR THEORY 145 


“absolute zero” and the reaction threshold is two reinforcements of bar- 
pressing. This says nothing of the transfer from prior learning of other 
manipulatory skills or eating habits, and so can scarcely be thought of as 
a true absolute zero of habit strength. In fact, careful experiments have 
shown that the bar-pressing tendency has appreciable strength prior to 
reinforcement. 


Corollary xiv. The competition of incompatible reaction potentials 


When reaction potentials to two or more incompatible responses occur in 
at the same instant, each in a suprathreshold magnitude, only that 


an organism 
reaction potential is greatest will be evoked.®* 


reaction whose momentary 
is convenient in the derivation of trial-and-error learning, 
where the choice is, say, between turning right or left. The organism will 
then make the response that has the higher momentary reaction potential. 
now accounts for probability of occurrence as a measure 
or competing responses. Asa broad generalization the 
ans self-evident. Antagonistic muscles may both be 
contracted at once. Then, while either flexion or extension may occur, the 
muscles will be under heightened tension, and the response will differ from 
what it would have been had only one muscle set been active. The pos- 
sibility of compromise response—much emphasized by Guthrie—must 


The corollary 


This corollary 
of response strength f 
corollary is by no me 


also not be overlooked. 


Postulate 14, Reaction potential as a function of reaction latency 
(sEp) can be inferred from reaction latency (stp); the 


Reaction potential r ) 
ter the reaction potential.®* 


shorter the latency the grea 
e system, as a causal chain, is that latency is a function 
of reaction potential, and the postulate was so stated in early versions of 
the system. However, Hull’s desire to emphasize the anchorage of the 
system to observables Jed him in his later versions to turn the matter around, 

the strength of the reaction potential could be inferred 


and to indicate how 
from latency. The form of the function he expressed as follows: 


sEr = @ (str) 7? 
ntify sEn in some absolute manner, independent of 
the special units of measurement used in experiments, occupied a great 
deal of Hull’s attention during his last years. One of his close associates 
has stated that he believes it “almost certain” that Hull considered the 


The logic of th 


The effort to qua 


82 Sch feld, Antonitis, and Bersh (1950b). 2 
03 nel fom Hull (1952a), pages 12-13; (1951), page 104; (1943), page 


344. 
6i Paraphrased from Hull (1952a), page 13; (1951), page 107; (1943), page 344. 
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articles on the quantification of habit strength (and reaction potential) 
to be the most significant of his theoretical papers. 

The method used in quantification consisted of a paired-comparison 
technique based on one of Thurstone’s scaling procedures. The basic 
assumption is that latency directly reflects excitatory potential, and the 
comparisons used are between latencies on successive trials for groups of 
animals trained under comparable conditions, Beyond this basic assump- 
tion of decreasing latency with increasing reaction potential, the remain- 
ing assumptions are essentially statistical ones, having to do with the 
deriving of an equal-unit scale. The unit, as previously mentioned, is that 
of o, the standard deviation of sEx. Because this unit is very general (not 
in grams, millimeters, seconds, or any other measure specific to experi- 
ments), it serves Hull’s purposes very well. Having devised this scale, 
Hull was able to co-ordinate other aspects of response to it, and the final 
postulates were based on the assumption that other measures could best be 
stated in terms of the unit derived from the study of latency. 


Postulate 15. Reaction potential as a function of reaction amplitude 


Reaction Pe may be inferred from reaction amplitude. In the case of 
the Tarchanoff galvanic skin reaction in man, the relationship between reaction 


potential and amplitude is a linear one.** 

The reinforcements in a galvanic skin response experiment with human 
subjects were converted to the new units of sE derived from the latenc’ 
of bar-passing responses in rats. These scaled values of 
to bear a linear relationship to the amplitude of the conditioned galvanic 
skin responses as recorded on an uncalibrated apparatus in millimeters of 
deflection. This is the empirical basis for the postulate.” 

Hull felt that he was on safe ground in confining the amplitude 
postulate to autonomic conditioning (as represented specifically by the 
Tarchanoff phenomenon). Amplitude in striate muscle conditioning is 
complicated by the possibility of limiting amplitude through conditions 


of reinforcement ( Corollary xiii), a complication Hull felt to be inap- 
plicable to autonomic responses,°$ 


sEr turned out 


66 Hovland (1952), page 849. The major papers are: Hull, Felsinger, Glad. t 
Yamaguchi (1947), Felsinger, Gladstone, Yamaguchi, and Hull (1947), Cen 
ae Hull, and Felsinger (1947), Yamaguchi, Hull, Felsinger, and Gladstone 
(1948). 


66 Paraphrased from Hull (1952a), page 13; (1951), page 109; (1943 
67 It may be noted, however, that the same argument from the Pes 
possible in 1943, before the more elaborate scaling method was devised, Compare 


Hull (1948), Figure 78, page 340, with Hull (1951), Figure 25, pa igi 

data from Hovland (1937b). g Page Ue. Oneal 
e8 Hull (1943), page 339. He knew, of course, that the same licati 

possible with latency, but he disregarded it on the assumption that riper Oe 
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Postulate 16. Total responses to extinction as a function of reaction potential 


A. The reaction potential (sEg) acquired following massed reinforcements 
can be inferred from the number of massed unreinforced evocations (n) re- 
quired to produce experimental extinction. 

B. The reaction potential (sE,) acquired following quasi-distributed rein- 
forcements can be inferred from the number of massed unreinforced evocations 
(n) required to produce experimental extinction, but for any given value of n 
the inferred sEp will be higher than under the conditions of Part A, and the 
forms of relationship differ for Parts A and B.° 

Hull gave equations for both of these functions, but because he had 
a tendency, especially in the final system, to select any convenient illustra- 
tion for purposes of quantitative statement, it is not at all clear how general 
he expected these relationships to be. That of Part A might conceivably 
have some measure of generality; that for Part B is surely merely illustra- 
tive, for “quasi-distributed reinforcements” during training followed by 
“massed trials” during extinction yield a whole family of functions. The 
particular function presented as an equation in the statement of the 
postulate 16A expressed reaction potential as a function of trials to ex- 
tinction. If inverted to state the causal order of events, the equation implies 
after massed reinforcements a curve of increasing gains for number of 
responses to extinction with increments in reaction potential. For example, 
a reaction potential of 2.0 o will be followed by 20 non-reinforced responses 
in massed extinction, while one of 3.0 ¢ will be followed by nearly 50 non- 
reinforced responses. After distributed reinforcements (Postulate 16B), 
the corresponding curve is one of decreasing gains with increases in reac- 
tion potential. While for sEx of 4.0 v there will follow 20 non-reinforced 
responses to extinction, even at the near-maximum of 6 o there would be 
less than 25 non-reinforced responses to extinction. It is doubtful if the 
contrast between these two functions does more than point to the com- 
plexity of the relationships involved with varying schedules of reinforce- 
ment—a point amply demonstrated by Skinner. 

y 

{ Individual Differences 

- Hull offered the suggestion that the “constants” appearing in his 
equations may vary with the learner, and hence serve as a kind of index to 
individual differences. If, for example, rate of human learning were found 
in some context to be a function of measured intelligence, then the in- 


perimental precautions could be taken. For criticism of latency as a measure see Mueller 
(1950), Skinner (1950). (1952a), page 18; (1951), pages 112-114. The relation- 


69 Paraphrase! Hull à = N 
ship Tera r had earlier been assumed to be linear (1943), page 344, while 


in 1951-1952 the functions are both curvilinear. 
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dividual IQ (or some equivalent of it) might appear as a parameter in 
the equation relating the number of reinforcements to the achieved reac- 
tion potential. 


Postulate 17. Individual differences 


The “constant” numerical values appearing in the equations stated in the 
basic postulates and corollaries vary from species to species, from individual to 
individual, and from some physiological states to others in the same individual 


at different times.7° 

The treatment of individual differences was purely programmatic, for 
the postulates and corollaries as stated were based on responses from 
groups of subjects. Furthermore, the constants were usually obtained from 
curve-fitting, and hence do not have the rational status desirable in inter- 
changeable constants subject to independent measurement. ) 

The problem sensed by Hull is an important one, and could be faced 
provisionally with groups of subjects tested under several conditions. Con- 
sider, for example, the constants in the curve for serial-position effects, 
worked out in the rote learning system. The three constants are L (the 
reaction threshold), AK (the amount of inhibition generated by each 
repetition of each nonsense syllable), and F (a factor of reduction whereby 
inhibition is progressively reduced at syllables more remote from the 
point of origin of the inhibition). If the same learners participate through- 
out, and everything else is kept constant through appropriate experimental 
controls, then the constants should change in some systematic way for 


groups of learners of different ages. For any one group of learners the 


constants should not change when new conditions are introduced, such 


as length of list, which have no bearing on the constants. We have no evi- 
dence of this kind at the present time to support the adequacy of constants 
as measures of individual differences. The logic 


enough, and it is these possibilities to which Hu 
postulate. 


al possibilities are clear 
ll called attention in the 


The Antedating Goal Reaction and It 


s Role in Secondary 
Reinforcement 


Hull ended his final set of postulates and basic corollaries with the 
following one. 


Corollary xv. Secondary 
When 


stimulus-res 


reinforcement by fractional antedating goal reactions 


a response (R) is associated with a stimulus trace (s) and this 
ponse conjunction is accompanied by an antedating goal reaction 


70 Paraphrased from Hull (1952a), page 13; (1951), page 117. Th tter was first 
dealt with systematically in Hull (1945). Pig RARE 
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(rq) and by the stimulus resulting from it (sg), there will result an increase in 
the tendency for that stimulus (s) to evoke that response (R). 
The reinforcement comes about because of the secondary reinforcing 


powers of s,;.74 

In this corollary and in the discussion of it, Hull says nothing more 
about the role of sg in primary reinforcement (Postulate 3). This omission 
supports the earlier interpretation that the addition of sę to Postulate 3 
was an afterthought. The secondary reinforcing power of sg in Corollary 
xv is that of any stimulus that has been closely associated with primary 
reinforcement (see Corollary ii). 

We have now completed a summary of the postulates and corollaries 
presented in the first chapter of Hull’s last book as the basic “laws” of 
his system. In later chapters he added only two more principles given the 
status of corollaries, although he stated 133 theorems. The corollaries 
must have had special meaning for him, as somehow fundamental, and 


they therefore merit discussion here. 


lization between reaction potentials to positive and nega- 
mple trial-and-error learning 
A, When an organism acquires two reaction potentials gE, 4. and sEr— to 
similar stimuli $ , and S_ in the evoking of responses R, and R_, which, while 
different, use substantially the same muscles, a gain in gE, through reinforce- 
, us 


ment will be accompanied by a generalized gain in sEr— 
B. When the reaction potential to the negative stimulus (sEy_) loses 


strength through extinction, a generalized loss in sEr4 will accompany this 
loss in Ep? 

The relationships in this corollary follow rather directly from stimulus 
tion (Postulate 10 and Corollary xiii). 


Corollary xvi. Genera 
tive stimuli in si 


and response generaliza 


Corollary xvii. Experimental extinction 
z i i lar reinforcement, the evoking stimu- 

-reinforced trials following regu , the g 
lus a fora mne will continue to evoke the response, (2) sometimes will evoke 
the res: nse for a time with a rise in reaction potential, (3) the rise (if any) will 
Sen ý fall in reaction potential, and (4) the course of the 


be followed by a progressive 
itial i r iti i rve.T3 
fall in a potential is the reverse of a positive learning curve. 


Why a new corollary, when the same facts were covered in Postulates 
9A and 9D? Except for the statement about an occasional rise in reaction 
potential with non-reinforcement, there is nothing new in the corollary 


itself, The novelty rests instead in some of the thinking that lies behind 
the corollary. Hull renewing an earlier suggestion," described the realiza- 
7 Paraphrased from Hull (19520), pages 14 and 125. 

araphrased from Hull (1952a), page oA 
Paraphrased from Hull (1952a), page 134. 
74 Tull (1937). 
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tion of an anticipation as occurring when actual eating (Sg—>R,) fol- 
_ lows upon the antedating sequence (re—>se). When the anticipated Se 
is omitted, as in non-reinforcement, the anticipation is frustrated. Frustrated 
anticipation may lead to more intensified activity (heightened sEņ) through 
aroused emotion (hence heightened D). This accounts for that portion 
of the corollary suggesting temporary increase in reaction potential early 
in extinction. In the end, the accumulation of In will lead to diminished 
sEr. The decay process is enhanced, however, by the conditioning of in- 
hibition to sc, yielding scIn. Because this is learning, we have reason to 
expect the form of the extinction curve to reflect ordinary learning, though 

in reverse. 
In this corollary we have a good indication of unfinished business, 


as Hull, clear to the end, was attempting to rethink his systematic 
formulations, 


The System Summarized 


The set of postulates and corollaries became somewhat formidable, 
and in places somewhat fragmented as Hull attempted to work into the 
basic principles not only all manner of quantitative relationships found to 
hold in classical and instrumental conditioning, but other kinds of phenom- 
ena needed to deal with types of problems beyond these reference experi- 
ments. Even so, it is possible to cut through some of the specific detail, 
and to summarize the system in rather direct fashion as a chain of anchored 
constructs beginning with the antecedent conditions (input), moving 
through the intervening variables to response (output). Such a summary 
is given in Figure 15, 

In Column I we have input conditions, all except sH, defined by 
objective experimental conditions, (sHr is habit strength from a related 
habit, to become expressed as generalized habit strength, sHp. ) 

In Column 2 we find the intervening variables most closely tied to 
the antecedent conditions, In Column 3 we assemble, in an intermediate 
step, the consequences of the simultaneous presence of the variables in 
Column 2. We are now (in Column 4) close to response evocation, but 
we must first take into account the oscillation of reaction potential (sOr) 
and the threshold of response (Ly), Finally (in Column 5) the response 
emerges, with the measurable characteristics of latency (str), amplitude 
(A), or number of non-reinforced vocations to extinction (n).\Formerly, 
probability of response was also used as a response measure, but in the 
final system the probability measure was felt by Hull to be appropriate 


-r 
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OUTPUT 
VARIABLES 


FIGURE 15. Summary of Hull’s final system. The input and output variables, and 


the intervening varia 


Column (1) 

N, number of prior reinforcements 

Cp, drive condition 

S, stimulus intensity 

w, amount (weight) of reward 

s'Hn, strength of a habit based on same 
response conditioned to another stim- 
ulus : 

W, work required in responding 


Column (2) 

sHr, habit strength 

D, drive , 

V, stumulus-intensity dynamism 

‘, incentive motivation 

sHr, generalized habit st 
lated habit 


rength from re- 


only when conflictin 
inappropriate as a measure from 


bles, are symbolized as follows in the figure: 


In, reactive inhibition 
sI, conditioned inhibition 


Column (3) 

sEr, reaction potential 

sEn, generalized reaction potential 
sin, aggregate inhibitory potential 


Column (4) 

sEn, net reaction potential 

sOn, oscillation of reaction potential 
eLx, reaction threshold 


Column (5) 

stn, reaction latency 

A, reaction amplitude 

n, number of non-reinforced responses to 
extinction 


g responses were involved, and hence probability is 
which to infer sEp.” 


The diagram falls short in not listing the several important stimulus 


components making u 


78 The original probability postul 
of reasons for withdrawing it ma 
Permits use of the empirical data 


p the complex of traces present when a response is 


ate occurs in Hull (1943), page 344. The discussion 
be found in Hull (1951), page 126. Corollary xiv 
‘ormerly included in the probability postulate. 
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evoked, important among which are the drive stimulus (Sp) and the 
fractional antedating goal stimulus (sg). The kinds of items serving as 
components of the stimulus trace are listed in Table 2. Because these 
components are not all contemporary in origin, some of them representing 
the subsiding residues of just prior stimulation, others (on the basis of 
acquired antedating responses) representing events still to come, the 
complex pattern of stimuli makes possible the linking together of response 
patterns covering large segments of time and space. The stimulus com- 
plex contains traces that both activate behavior and guide it. 


TABLE 2. The Components of a Stimulus Trace 


Origin of Stimulus-Trace 
Component, with Symbol 


Resulting Component of 
the Stimulus Trace 


Environmental stimulus ..... S s Molar afferent impulse 

Compound stimuli ......... Sa S2... § Molar afferent impulse modified by 
afferent interaction 

ES Sas cist cited Salnaetas D Sp Drive stimulus 

RESPONSE) sae vince erence isas R Sr Proprioceptive consequence of re- 
sponse 

Fractional antedating goal 

TESDONSC! regiek 4 aiaiai To Sg Fractional antedating goal stimulus 


DERIVED INTERMEDIATE MECHANISMS 


Hull's system is characterized by him as molar, by which he me 
non-physiological. That is, the inferences to intervening vari 
depend upon the discovery of the physiological substratum wł 
in a “molecular” way. The assertions about molar stimulus traces, or af- 
ferent interaction, need not include specification in terms of the con- 
stitution of nerve bundles or the characteristics of the nervous impulse. 
Hull vacillated somewhat by including physiological language in his postu- 
lates. In any case, Hull’s system classifies as a reductive system, in that 
more complex phenomena are deduced on the basis of presumably simpler, 
more basic phenomena and relationships. In this sense the more complex 
are “reduced” to the simpler through analysis, It is characteristic of all 
systems of this kind that they intend to explain behavior that is super- 
ficially unlike the behavior from which the postulates are derived. In other 
words, Hull did not intend merely to systematize the account of rat lever- 
pressing, from which most of the data for his later set of postulates derived. 
He intended to arrive at the basic laws of behavior, at least the laws of 
the behavior of mammalian organisms, including the social behavior of 
man. 


In order to bridge the gap between the simple laboratory experiments 


ans 
ables do not 
hich is causal 
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furnishing the constants in the postulates and corollaries, and the more 
familiar behavior of organisms adapting to a complex environment, he 
derived intermediate mechanisms. Once these mechanisms were available, 
they opened the way to the explanation of many more varieties of behavior., 
We shall consider two of these mechanisms: the gradient of reinforcement 
(originally the goal gradient), and the habit-family hierarchy. 

We have to distinguish between the historical and the logical order 
of postulates and intermediate mechanisms. Many of Hull’s most brilliant 
deductive accounts of complex phenomena were made in early papers, 
prior to the postulate systems.” Many of these accounts were essentially 
unchanged when they were reworked for the 1952 book, even though 
in the meantime the postulate system underwent substantial revision. 
From a logical point of view, however, once the system has been con- 
structed, the postulates take priority. The same applies, of course to 
predictions. An experimental fact may be “predicted” by a system, even 
though the fact has long been known. The only advantage in predicting 
new (unknown) facts is that, in predicting them, there is less temptation 
to include the facts in the postulates used as a basis for their prediction. 


The Gradient of Reinforcement (the Goal Gradient) 


There are two chief kinds of time gradients involved in conditioning 
experiments. The first (found primarily in classical conditioning) is based 
on the time between the conditioned stimulus and the unconditioned 
stimulus, short forward intervals being found most satisfactory, Hull took 
this interval into account in developing his postulate regarding the stimulus 
trace (Postulate 2B). The assumption is that conditioning is actually simul- 
taneous, but the gradient is produced by a rising and falling molar trace 
set into motion by the external stimulus. The second kind of time gradient 
(found chiefly in instrumental conditioning) is based on the time between 
the response to be strengthened and the reinforcement. This second gradi- 


ent is considered in the corollary concerned with delay in reinforcement 


(Corollary iiiB). As pointed out earlier, the simple gradient (a single 
response reinforced ) is assumed to extend for about 5 seconds; if a rein- 


forcement is delayed more than 5 seconds, its effectiveness must depend 


upon other mechanisms. ; : 
The measurement of the gradient of reinforcement depends upon 
a 


experiments of the instrumental conditioning type, in which an act such 


76 For an introduction to the deductions available before 1940, see the summaries in 
Hilgard aval Marquis (1940); maze learning, 216-221; serial verbal learning, 221-296; 
reasoning experiments, 936-241; circumventing a barrier, 242-243. 
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as pushing a lever or running down an alley leads to the goal-object and 
reinforcement. Delay may be introduced between the act to be rewarded 
and the goal-object which provides the reinforcing state of affairs. Thus 
the delivery of the pellet of food which is the reward for lever-pressing 
may be postponed for an arbitrary number of seconds in order to study 
the effect upon rat learning of such a delay. The gradient of reinforcement 
is derived from such delayed reinforcement experiments, as distinguished 
from the stimulus-response separation experiments of classical condition- 
ing. Hull at first believed, on the basis of a delayed reinforcement experi- 
ment done with rats in the lever-pressing box, that the basic gradient of 
reinforcement was fairly short, in the case of the rat, “possibly no more 
than thirty seconds and very probably less than sixty seconds.” 

Other experiments on the effect of delayed reward, such as those of 
Wolfe (1934), show that in a simple alley maze the reward is effective 
for learning with much longer delays, at least as long as 20 minutes, the 
longest delay tested. A gradient is shown here, also, w 
tervals less effective for learning than the shorter ones. 

Let us see how these gradients are related, before we turn to the 
manner in which they are used to derive other forms of behavior, The 
final theory somewhat unclearly implied four steps: 

1. The influence of reinforcement is solely upon the immediately 
concurrent stimulus and response units, There is no genuine primary gradi- 
ent of reinforcement. In this Hull follows Spence (1947), and he has 
eliminated delay in reinforcement from his postulate set. 


2. For a single stimulus-response conjunction followed by delayed 
reinforcement there is a short 


5 seconds. This is expressed in 
basis of rg and its Sc, 

8. For links in a 
links farther remove: 


ith the longer in- 


seconds. It is described qual- 
itatively in Corollary iiiA, but is not derived, While later it is treated 


quantitatively, the quantities are assigned on the assumption that the 
gradient exists in appropriate magnitudes. Nowhere is it linked quanti- 
tatively with the gradient of Corollary iiiB just described, 

4. In any actual chain of responses as observed 


and recorded, the 
gradients of reinforcement affecting individual links in 


the chain summate 
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in complex fashion to produce empirical gradients ™ bearing little re- 
semblance to any single inferred underlying gradient of reinforcement. 

To be strictly elegant in system construction, we ought to be able to 
move by formal steps from each one of the foregoing numbered proposi- 
tions to the next. We can follow Hull's attempt to get from (1) to (2), 
though his derivation involved outmoded features of his system.’* He 
moved from (2). to (3) very informally, without attempting to construct 
a sound bridge between them. He made a more serious, and more success- 
ful, effort to move from (3) to (4). To that step we now turn. 

This is our first illustration, in any detail, of how Hull proceeded 
to use an intermediate mechanism in accounting for concrete behavior in 
a complex situation, Beginning with the assumption of the long gradient 
of reinforcement of point (3) above (an assumption already involving 
many steps from the basic postulates), the problem is to account for the 
actual latencies of response at each step of a four-response chain, leading 
to reinforcement at the end. 

Let us begin with the empirical data that we wish to derive. They 
come from an experiment by Arnold.?® A rat learns to press a button 
presented outside the window of its restraining cage, whenever the shutter 
is opened. Pressing the button is rewarded by a pellet of food. Now the 
experiment proper begins. A clever device presents to the rat a chain 
of four stimulus-response possibilities by means of a car running by his 
window on a track. In this experiment, there are four parts of the car, 
all alike, all with buttons exactly like the one he already learned to press, 
When the shutter is opened, button B, is presented. He presses it, and 
starts the car in motion, though no pellet appears. Three seconds later the 
second button B. has come into view as the car stops with it before the 
window. He presses it; now button B, appears. In three more seconds he 
has the opportunity of pressing B,. Because the TES pousenwas well learned, 
there is not enough extinction in these few non-reinforced trials to pre- 
vent the response. Having made the four responses in order, the shutter 
falls, and the food reward appears, giving terminal reinforcement to the 


whole linked series. The series of events is presented once a day, and a 


measure of reaction latency taken at each button. The results for trials 


is a convenient term for the gradients actually 
a Pay, 7 tween the gradient of reinforcement 

fo the distinction was made bet e gi 

oha. At one a strengthening, and the goal gradient as the effect upon per- 

es a distinction first suggested by Miller and Miles (1935). But when the 

gradient "of reinforcement came itself to be derived rather than basic, the former 

distinction between it and the goal gradient was no longer maintained by Hull, 


7 e ages 127-128. 
ri ane at ea ditional experiments in the series were also considered by Hull. 


77 The term “empirical gradient” 
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2-10 are shown in Figure 16. While there is a kind of gradient of rein- 
forcement, with latency getting shorter nearer to the goal, there is a 
striking upturn at B,. The theoretical problem is to derive this empirical 


gradient from the overlap of gradients, all of which fall off from the point 
of reinforcement. 


IN SECONDS 


MEAN REACTION LATENCY 


oi 


i m w 
ORDER OF MANIPULANDA PRESENTATION 


FIGURE 16. Reaction latency of responses to each of the four stimuli in a be- 
havior chain with terminal reinforcement. Adapted from Arnold (1947), page 356, 
and Hull (1952a), page 164. 


The steps involved are as follows: 
1. On the basis of prior learning, there is the original 


sEr available 
equally to B,, Ba, B,, and B,. Some initial value must be assumed. (Hull 


arbitrarily assumed 2.0 ø.) 

2. The general form of the gradient of reinforcement with time is 
assumed, and both its form and the number of trials required to generate 
it are arbitrary, though the form is consistent with gradients of gener- 
alization. The resulting increments of reaction potential at each of the 
stimuli turn out to be: B,, .234; B., .380; B,, .617; B,, 1.000 o. This conforms 
to our expectation of greater gains nearer to reinforcement, bec 
forcement follows B,. 

3. The original sEr (2 o) and the increment through reinforcement 
must be added for each link in the chain, according to the principle by 


ause rein- 
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which reaction potentials are combined. The sum will, of course, be less 
than the arithmetical sum. The result turns out to be: B,, 2.16; Ba, 2.25; 
B,, 2.41; B,, 2.67. These values Hull calls the gradient-of-reinforcement 
values in this context, because they are the values at each link before we 
consider the interaction of the different links. 

4. We must now take into account the fact that a stimulus does not 
cease to operate when it is withdrawn, but its trace persists. Hence that 
persisting stimulus trace will derive reinforcement from each of the sub- 
sequent secondary reinforcements along the route to the goal, and through 
generalization the effect will be felt at the next onset of the stimulus. 
Similarly, each response along the way contributes strength to the links 
preceding it (whose traces are represented when it is being reinforced) 
and to the responses following it (because its stimulus trace will be part 
of the stimulus complex when these succeeding responses occur). This 
two-way contribution of each link to the others is computed, again through 
arbitrary formulas whose exponents have no relationship quantitatively to 
those given in the postulates, though the form of the equations is’ as 
postulated. The results are given in Table 3. 

TABLE 3. Steps Used by Hull in Computing the Theoretical Mean Reaction Laten- 
cies at the Response Points of a Four-Link Response Chain. The gradient of rein- 


forcement values, from which the remaining values in the table are generalized, are 
owr in bold type. See text. From Hull (1952a), page 162. 


Components of Reaction Potential 
(sEr) 


Assumed d values 3 1 


(in j.n.d.’s) 
Response number B: Bı B: Bı 


Values based on delay 


in reinforcement 
9 seconds delay ae ae 2> o 
6 seconds delay SD 121 2.41 121 
3 seconds delay = 95 1.89 2.67 
0 seconds delay se : á i 

a 1 D 3.540 3.780 3.467 
Behavior sums (+) of Er 2.827 ” ” r 
Reaction latencies (etn) 3.51 2.19 1.91 2.80 


5. Now we add up the various components of reaction potential at 


each link in the chain. The addition is again by formula, and not straight 


arithmetic. The sums are given in the row second from the bottom of 


Table 3. 
6. The final step i 
again an arbitrary formula, 


s to convert these sEx’s to latencies (strs). Here 
with new constants but familiar form, is used, 
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and the latencies computed, as shown in the bottom row of Table 3. This 
is the end of the determination, and the results are plotted in Figure 17, 
for comparison with Figure 16. 

The general agreement between the two figures is striking enough to 
indicate Hull's ingenuity. There is some disagreement in the absolute 
magnitudes, however, and this led Hull to assert: 

This degree of agreement between experiment and theory is, perhaps, as 
close as may reasonably be expected in the present early stage of the science.‘° 

The agreement lies in the form of the function. The failure lies in 
the latency measure—a matter of really great importance to the system 
because of the reliance on latency for quantifying sEr. Hull had earlier, 


4 


nN w 


REACTION TIME IN SECONDS 


o 
[i 2 3 4 
ORDER OF REACTIONS IN CHAIN 


FIGURE 17. Theoretical reaction latency of res onses to each of imuli 
in a behavior chain with terminal reinforcement, anpare with the e pA ein 
shown in Figure 16. Adapted from Hull (1952a), page 163. Š 


in choosing the equation for the computation of latency, multiplied the 
values derived from Postulate 14 by an arbitrary constant (8+) because 
he noted that the values would otherwise have been too small,*-then two 
pages later he finds that he had not boosted the latency enough. He could 
easily have chosen a constant to make the values correspond much more 


60 Hull (1952a), page 165. 
81 Hull (1952a), page 163. 
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closely, by multiplying by 50 instead of multiplying by 3. Possibly he 
worked the matter out independently, and then felt it would be deceptive 
to work backwards from the result, or thought a 50-fold correction was 
too great. Because the constants are arbitrary anyhow, he might as well 
have worked backwards, and achieved as good a fit as possible. Had the 
constants been independently determined, there would have been some 
point in seeing how much discrepancy remained. As it stands, the pre- 
diction of the form of the function is an achievement. Everything else 
is arbitrary, and consequently the achievement itself has much that is 
ad hoc in it. 

The main point of this detail with respect to the derivation of the 
gradient for a chain of responses has been to point out how the derivation 
depends primarily upon the intermediate mechanism (the gradient of 
reinforcement), but then uses other principles (e.g., generalization, sum- 
mation of reaction potentials, conversion of reaction potential to latency ) 
to achieve a predicted result, an empirical curve that bears little resem- 
blance to the principles used in its deduction, but corresponds closely in 
form to the results of a complex learning experiment. The deduction 
points out also how deceptive is the apparent quantitative rigor of the 
system, The quantities in the postulates are mere illustrations of 
quantities, and are not, in fact, made use of in any consistent manner in 
later derivations. 

While the gradient of reinforcement is not the primary one it was 
earlier considered to be, it is a derivative or intermediate principle, which, 
once established, again serves its former purposes in the explanation of 
more complex learning. The original application was to the maze. The 
principle mediated the deduction that responses nearer to the goal would 
be more strongly conditioned than those farther removed, so that short 
paths would be preferred to longer ones, blinds near the goal would be 
adily than blinds farther away, longer blinds would 
ated than shorter ones, and so on.** The goal 
plied to field-force problems as studied by 


eliminated more re 
be more readily elimin 
gradient principle was later ap j ; : : 
Lewin.® For example, in experiments involving the circumventing of 
barriers between the learner and a visible goal, Hull proposed that the 
reaction to the perceived goal-object should behave in accordance with 
the goal gradient. That is, the nearer the learner came to the goal, the 


troductory exposition, see Hilgard and Marquis (1940), 


P ul (1092). Ronan m history of the problem of the goal gradient, see Hull 


pages 216-221. For the earli 
eal ees the blems were such as proposed by Lewin (1933a) (1935). 


83 Hull (1938). Th o A 
The EE ap ep aed with little change in Hull (1952a), pages 262-268. 
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stronger should be its response-evoking power. Thus Hull, by way of 
the gradient of reinforcement, came to conclusions similar to those de- 
scribed by Lewin as goal-attraction in relation to distance. 


The Habit-Family Hierarchy 


A second derived principle is that of the habit-family hierarchy. 
This is not included among the postulates because, like the gradient 
of reinforcement, it is a principle at intermediate level, being itself de- 
rived from more basic principles. It carries great weight, however, in 
the deduction of further behavioral phenomena. 

Because there are multiple routes between a starting point and a 
goal, the organism learns alternative ways of moving from a common start- 
ing point to a common goal-position where it finds need satisfaction. 
These alternatives constitute a habit family because of an inferred inte- 
grating mechanism. The integration into a family is by way of the frac- 
tional antedating goal reaction, present as each alternative is active, 
The fractional anticipatory goal reaction provides a stimulus (sq) to 
which all overt responses are conditioned. Through the differential action 
of the gradient of reinforcement, some responses are less strongly con- 
ditioned to sę than others. The starting responses of longer routes, for 
example, are more remote from reinforcement than the starting responses 
of shorter routes. Hence the latter are more strongly reinforced, and 
more strongly conditioned to sg. As a consequence, the alternative be- 
havior patterns are arranged in a preferred order. The less favored routes 
are chosen only when the more favored are blocked. It is this set of al- 
ternative habits, integrated by a common goal-stimulus, and arranged in 
preferential order, that constitutes a habit-family hierarchy. 

It is further deduced by Hull that if one member of a habit-family 
hierarchy is reinforced in a new situation, all other members of the 
family share at once in the tendency to be evoked as reactions in that 
situation.** This makes possible the explanation of response equivalences 
and other appropriate reactions in novel or problematic situations, such 
as those found in insight and reasoning experiments. 

The principle was first applied to maze learning,** serving chiefly to 
explain the tendency for the rat to enter goal-pointing blinds, even though 
such entrances had never been reinforced in the maze situation. Goal 
orientation was taken to represent an inappropriate transfer of spatial 
habits acquired in free space. Another application was in relation to the 


84 Formal deduction was first given by Hull (1937), page 27. 
85 Hull (1934a). 


HULL'S SYSTEMATIC BEHAVIOR THEORY 161 


detour experiments.** The difficulty of turning away from a perceived 
goal beyond a barrier depends on the presence of habit-family hier- 
archies as well as upon goal gradients. In the usual experience of free 
space, the favored path is the straight line between the learner and the 
goal. The next-favored starting response is that making least angle with 
the goal. The greater the angle, the less favored is the starting response 
in that family of habits built up in previous experience. Hence, when 
blocked, the learner prefers a path which goes off at a right angle to one 
which requires that he turn his back on the goal. In some objective situa- 
tions he may come to choose a longer path to a shorter one, if the habit- 


family hierarchy proves to be misleading. 


SOME REPRESENTATIVE DEDUCTIONS AND THEIR 
EXPERIMENTAL TESTS 


We have met one illustration of Hull’s deductive procedure in his 
computation of reaction latencies at each link of a four-link reaction 
chain with terminal reinforcement. His system commanded its initial at- 
tention because of his skill in carrying out corresponding computations 
in a variety of fields of experimentation. Summaries of many of his 
earlier deductions are already available.“ A number of these deductions 
were repeated in the final book, with moderate changes to conform to 
the new system. The three illustrations chosen for discussion in what fol- 
lows are, in substance, new to the final book (response alternation in 
trial-and-error learning, latent learning, and adient-abient conflict ). Al- 
though they represent but a small sample of the total available deduc- 
tions in the final system, they will serve to demonstrates Hull’s method of 
arriving at new phenomena with the help of familiar postulates. 


Response Alternation in T rial-and-Error Learning 

as impressed by the gradualness of trial-and-error learn- 
mplied that a response, once correct, was not con- 
after, and that an incorrect response, once elimi- 
p occasionally. Hull attempted to come to 
d with this problem of response alterna- 


Thorndike w: 
ing. The gradualness i 
sistently repeated there 
nated, must have cropped u 
closer grips than Thorndike di 
tion in trial and error. 


It will make the problem more concrete to begin first with the em- 
86 Hull (1938). on serial learning and problem-solving in Hilgard 


87 In a iti summaries 5° 
and Marquis (1840 ee rote memorization and related phenomena, see the first edition 


of this book, Hilgard (1948). 
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pirical results, and then to turn to the deduction. In one of his own 
experiments Hull (1939b) used a modified Skinner box, as sketched in 
Figure 18. The experimenter could present either the vertical or the 
horizontal bar alone, or both together. The rat first learned to operate 
the vertical bar alone, and continued to operate it for 15 food-reinforced 
responses. The following day the experimenter presented only the 


FIGURE 18. Diagrammatic representation of box used to study trial-and-error in 
the two-lever problem. Either the horizontal bar (H) or the vertical one (V) could 
be made positive; that is, when the appropriate bar was pressed a pellet of food fell 


through the tube (T), into the food-dish (R). After Hu (1939b), page 237. 
horizontal bar, and the rat learned to operate it for 60 food-reinforced 
responses. The crucial test came the next day. The experimenter first 
allowed a warm-up by permitting four reinforcements of the already 
strong horizontal bar habit. Then he presented both bars at once, but set 
the apparatus so that only response to the vertical bar led to reinforce- 
ment. This became a simple trial-and-error experiment, requiring a re- 
versal from the stronger and more recently reinforced to the weaker and 
earlier reinforced habit. 

All 76 animals that finally mastered the trial-and-error learning met 
the criterion of learning gradually, responding now to the horizontal bar, 
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now to the vertical bar. Sometimes the rat would respond to one bar 
for several trials, then respond to the other for a run of trials. Hull 
grouped these vacillating runs into what he called alternation cycles. 
He defined a cycle as including two phases, a run of responses to the 
vertical and a run of responses to the horizontal, such runs being pre- 
ceded and followed by changes of response. Thus the cycle lasts from a 
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r alternation cycle in simple trial-and-error learning 
originally more strongly conditioned than the cor- 
as plotted in Hull (1952a), page 52. 
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FIGURE 19. Responses pè 
when the incorrect response was 
tect one. Data from Hull (1939b), 


shift in a given direction (say from horizontal to vertical) through the 
shift in the opposite direction (from vertical to horizontal) and con- 
tinues until the original shift occurs again. This corresponds to the usual 
definition of a two-phase cycle. The length of the cycle is determined 
by the number of responses in the runs making up the two phases of 


the cycle. 
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In order to exaggerate the effect somewhat, Hull selected the data 
from the 25 animals (among the 76) which yielded from four to six com- 
plete response-alternation cycles. His plot of the number of responses 
per alternation cycle for the first four cycles is reproduced as F igure 19, 
Can such a curve be predicted on the basis of the postulates of the 
system? Here is how Hull went about it. 
"He first made use of an earlier derivation of the course of a proba- 
bility-of-reaction curve for learning under these conditions. His earlier 
computations had resulted in a table of reaction-probabilities of response 
to the positive and to the negative stimulus on successive occasions. First 
the reaction potentials to the positive and negative stimuli were com- 
puted in ø units. The probability of a response occurring to one or the 
other of the stimuli depends upon the amount of Separation of these two 
reaction potentials, because they are in conflict. Bec: 
based upon ø units, he could use the familiar formulas for significance of a 
difference and determine a critical ratio, d/o. The probability of reaction 
can be determined by the probabilities associated with this ratio, The 
result of this earlier computation was a sigmoid curve of 
the positive response.** 

While Hull seems here to be making simply a familiar statistical 
computation, the psychological (and systematic) assumptions underlying 
it are consonant with the assumptions used in scaling sEn, because the 
o unit used in the computation is the inferred behavior 
(sOr). 

From the values entering into the probability-of-reaction learning 
curve, he moves on to response alternation without additional psycho- 
logical or systematic assumptions, The next steps are purely statistical, 
and depend solely upon principles of probability. The principles are 
used with considerable ingenuity to derive the curve that is plotted as 
Figure 20. What this curve says is that response cycles will at first be 
relatively long (consisting mostly of the negative responses, not yet ex- 
tinguished), that they will fall to some minimum length, and then in- 
crease in length again (at the end consisting largely of positive responses, 
now overcoming through reinforcement their original handicap relative 
to negative responses). 

Hull has now, by Way of a number of steps consistent with his 
theory, supplemented by probability considerations, derived a theoretical 
curve that shows a strong family resemblance to the empirical curve of 
Figure 19. The abscissa units are not alike, and precise comparison is not 


ause the potentials are 


acquisition of 


al oscillation 


88 Hull (1952a), pages 30-33, 
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possible; but the resemblance is strong, and the deduction is in this 
case an unusually rigorous one. 

The plotting of response-alternation cycles exhibits another of Hull’s 
strengths—an ingenuity he showed in discovering or inventing new ways 
to summarize the data from familiar types of experiment. The theoretical 
basis for the curve shown in Figure 20 permits other deductions under 
different conditions of experiment, so that this figure is but one of a 
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D ical responses per alternation cycle when the incorrect re- 
IRURE 20; TOE atro a conditioned than the correct one. Compare with 


a iginally mor ni 
kiea GINS oF Figure 19, After Hull (1952a), page 48. 


family of curves. Mathematical formulation has the advantage that shifts 
in fhe form of functions can be predicted when the parameters change. 


These shifts are often very different from those that might be guessed or 


intuited. If they are then confirmed experimentally, the theory receives 


a genuine support. 


Latent Learning 
Sometimes, when a new incentive is introduced, animals perform- 
, 


ing at a very mediocre level without the incentive make sudden gains, 
: 

i i g para A 
bringing them abreast of animals trained all along with the favored in- 
centive. Because the new incentive seems to bring out concealed or 
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unused learning, experiments of this kind, when successful, are said to 
reveal latent learning. Such experiments were formerly a source of con- 
troversy between reinforcement theorists and cognitive theorists. The 
cognitive theorists cited these experiments in support of their position 
and pointed out the difficulties for the reinforcement theorist. In the type 
of theory Hull held in 1943 they were indeed difficult to account for, 
because sHp then depended upon the magnitude of the reinforcement. 
With the change in emphasis from <Hp to sEr, Hull no longer had diffi- 
culty with latent-learning experiments and proceeded to deduce their 
results as consequences of his new theory. 

It is doubtful that Hull deliberately shifted his system in order to 
account for latent learning, for the facts of latent learning were at the 
time very much in doubt, and his followers were still very critical of the 
results of earlier latent-learning experiments. Another possibility is that 
the shift from sHy to sEp occurred because of Hull’s preoccupation with 
quantification. He found that he could quantify sEr, but was far from 
quantifying sHy. Hence it was preferable to write more equations in 
terms of sEp.®” 

The method by which Hull derived latent learning is simplicity 
itself. So long as there exists a modicum of reinforcement, sHr builds 
up as a function of the number of trials. The revealed Ep will be small, 
and the revealed learning slight, because one of the multipliers (K, the 
incentive reinforcement) is small. Now when K is made large, through 
the introduction of the major incentive, we have a new multiplier, and 
the amount of sEr (for the same sHr) will be much greater. This is 
largely what is meant by latent learning: a sudden gain in sEp with the 
introduction of incentive, 

Hull’s deduced latent learning is shown schematically in Figure 
21. Shifts of incentive, either upwards or downwards, within a few trials 


place the organism on the learning curve characteristic of animals trained 
all along with the new incentive, Hull acce 


pts the familiar latent-learning 
data of Tolman and Honzik (1980) 


as supporting his prediction, 
Adient-Abient Conflict 


The postulates and corollaries include references to 
responses and to a factor (K) having to do with the 
centives used in prior reinforcements, They 
perceived incentives. Hence Hull saved t 


anticipatory goal 
magnitude of in- 
assert nothing, however, about 
he problems of perceived in- 


89 This is also Koch’s interpretation, Koch (1954), page 106, footnote 29, 
90 We shall meet latent learning again in the next chapter. 
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centives to be treated through theorems derived from the more basic 
postulates. 

Perceived incentives are very important in ordinary behavior. Con- 
sider a cat lying in wait to pounce upon a bird, or a buzzard circling 
in search of carrion, or a male dog attracted by a female in heat. The 
ongoing behavior is very much influenced by the perceived incentive. 
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centives are of two kinds: those, such as food, which in- 
vite approach; and those, such as noxious stimuli, which invite avoidance. 
Hull, selecting some older terms, called approach behavior adience or 
adient behavior, and withdrawal abience or abient behavior. 

Hull takes the major steps from the postulates to the basic theorems 
of adient and abient behavior informally, that is, swathone quantitative 
reference to the postulates (except for similarities in the forms of func- 
tion chosen). In the first step. he accepts it as plausible that approach 
responses to an attractive incentive near at hand have been conditioned 


Perceived in 
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according to familiar principles. The same is implied, but in reverse, for 
withdrawal responses. At a distance, an incentive object, whether adient 
or abient, will produce a stimulus pattern resembling the one close to the 
object, so that responses conditioned to the object itself will be condi- 
tioned to the object at a distance, at an intensity corresponding to the 
difference between the stimulus patterns for the near and far object. 
Thus the familiar generalization gradient applies. It is further assumed, 
on the basis of these considerations, and because locomotion is itself a 
highly generalized kind of behavior, that both adient and abient behavior 
will be highly generalized in respect to both direction and distance. 
Through these informal steps, largely by analogy with matters dealt with 
in the postulates and corollaries, Hull succeeded in providing for the 
kinds of conflicts in free space that Lewin treated under his concepts of 
“valence,” an adient goal being (for Lewin) positively valenced, and an 
abient goal being negatively valenced. 
Now we come to the quantitative portion of the deduction. For this 
purpose we take a special problem of adient-abient conflict, best illus- 
trated in the experiments of J. S. Brown (1942) (1948) and the related 
experiments as reported by N. E. Miller (1944). Brown determined two 
points on a gradient of adient behavior, and two points on a gradient of 
abient behavior, as illustrated in Figure 22. A hungry rat ran down an 
alley to receive food at the end, the food location being distinctively 
marked with a light. The animals wore a little harness attached to a cord. 
The harness did not restrain them during their learning, but in testing 
they could be restrained by the cord, and the strength of pull measured, 
This is the pull in grams as recorded in Figure 22 at the two points of the 
approach gradient. Other animals, instead of receiving food at the end 
of the alley, received electric shock. When restrained near the point of 
shock, they pulled hard to get away from the shock zone, but if restrained 
at a distance they pulled away only slightly, These two points make up 
the avoidance gradient, which is seen to be much steeper than the ap- 
proach gradient. Miller and his associates predicted that if animals were 
both fed and shocked at the end of the alley, they would be placed in 
conflict. When placed at the end of the alley farthest from the ambivalent 
goal, they should approach part way and then stop, the point of stopping 
being related to the point at which the two gradients cross, because here 
the opposite pulls are equal. By strengthening the hunger or weakening 


the shock, the animals could be made to approach nearer to the goal 


91 The important evidence for the theorem under study is from an unpublished study 
by Miller, Brown, and Lipofsky (1943), discussed by N. E. Miller (1944 ), page 487. 
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before stopping. Hence the experimental results agreed with the inter- 
pretation that the two independent gradients measured by Brown would 
interact when both food and shock occurred together. 
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FIGURE 22. Empirical form of approach and avoidance gradients. The straight- 
line gradients are based on two points determined by J. S. Brown for each gradient 
Separately, as reproduced by Miller (1944), page 434. The theoretical point of equi- 
librium has been added. 


Now let us see how Hull, with these facts before him, ties them into 
his system. This is a good illustration of prediction after-the-fact, but 
such prediction is, of course, entirely permissible in a deductive system. 

The theorem with which Hull comes out is stated as follows: °? 


Theorem 86. Other things equal, with moderately sophisticated subjects, 
when an adient object and an abient object occupy nearly the same point in space 
and the maximum abient reaction potential is greater than the maximum adient 
potential there will be a point of stable equilibrium at a j.n.d. distance from 
the adient-abient object amounting to 

Er 
log, $ 
B10 sEy 
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d = + $ 
r= 
Miller’s conclusion is thus given a precise mathematical form. The 


derivation here will serve to provide a step-by-step illustration of the 


method Hull used much more generally. 
First, for the straight-line gradients of Figure 22, Hull substituted 


92 Hull (1952a), pages 249-250. 
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the curved gradients of Figure 23, in order to use curves of the same 
form that he used in generalization gradients (Postulate 10). The adience 
exponent he made half that of the abience exponent, simply because 
Brown and Miller had found the two gradients to differ in slope. 

This tampering with the exponents, in order to agree with the results 
of experiments, is allowable, but it should follow some of the rules of 
the system. In strict logic, because Hull does not deduce these differences 
in exponents, he must postulate them. Therefore he has added a new 
but unacknowledged postulate to his system at this point (until such 
time as the differences are derived from more fundamental principles). 
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FIGURE 23. Theoretical adient and abient gradients used by Hull in predicting 
the point of equilibrium in an 


adient-abient conflict. Compare with the empirical 
results of Figure 22. Replotted from Hull (1952a) to conform with the conventions 
of Figure 22. 


The point of equilibrium in Figure 23, like that in F igure 22, is the 
point at which the two curves cross. What Theorem 86 states is merely 
the algebraic solution of the problem solved graphically in Figure 23, 

For adient behavior, we assume the gradient to be of the form: 

sEn = sE X 10-% 


Where sẸp is the generalized reaction potential 
a distance d in j.n.d.’s from the point of reinforce 


(1) 
at any point on the gradient at 
ement. 


The corresponding gradient for abience is: 


sE'r = sE’g X 10—74 (2) 
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To derive the equation required for the theorem we find that value of 
d at which the two gradients will balance each other, where sEx = sE’p. 
This is a matter of simple algebra, which looks more forbidding than it 
is partly because the subscripts S and R are appendages without any 
mathematical meaning. 

We know that, at the value of d which we seek 


sEn = sE’p (3) 
Hence, from (1) and (2) 
sEr X 10-44 = sE’g X 10-74 (4) 
Eliminating the negative exponents, we may write 
sEr _ sE'r 
10 ~ 104 (5) 
Transposing terms, we have 
107? _ sE’p 
10 Er (6) 
Taking the logarithm of both sides, 
E’, 
vd — jd = log,, =—* 
ja —j Bio Ep (7) 
Solving for d, the value we are after, 
E 
logio Ee (8) 
d= — = 
“=i 


is the one appearing in the theorem. It results solely 
orm of the gradient the form earlier adopted for 
th the added assumption (similar to Miller’s) 
point where the two gradients coincide. 
To prepare a plot as in Figure 98, it is necessary to make computa- 
tions, using actual values for the terms entering into equation (8). Let 
us see how Hull did this. He assumed the following values, which, while 
arbitrary, are of an order of magnitude used elsewhere in the system: 


Equation (8) 
from selecting as the f 
generalization gradients, wi 
that there is an equilibrium 


sE’'r = 40 The greater magnitude of the abient reaction potential sE’y is 
sEr = 30 assumed in Theorem 86. 

if .02) The greater steepness of the abient gradient (j’ > j) is assumed to 
i (01 J be a fundamental characteristic. 
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Substituting these values in Equation (8), we have 


I o 
8 Be _ log 1.333 
d= 01 or 


Looking up the yalue of log 1.333 in a table of logarithms, we find 
log 1.333 = .1248, hence 


-1248 
01 


By examining Figure 23, we see that the algebraic and graphical 
solutions agree (as indeed they should! ). 

What do we learn from these computations? When we follow Hull’s 
computational technique, we find that very often the results are obtained, 
as in this example, through assigning equations to simultaneous functions, 
and then solving the simultaneous equations. The tie-in with the total 
system is often very slight. In this case, the forms of the gradients are 
the same as those used in generalization gradients, and the constants 
are similar in meaning. But the accuracy of the deductions does not de- 
pend upon these similarities, and, in fact, some of the correspondence 
between the system and the deductions is purely gratuitous. For ex- 
ample, no operational meaning is given to a difference in jn.d’s along 
a gradient of this kind, so that there is no check whatever on the use of 
these units. An accurate use of empirically defined j.n.d. units might very 
well change the slope of the gradient. Furthermore, the puzzling prob- 
lems of the relative effectiveness of reward and punishment, which ought 
to be dealt with as central features of a reinforcement theory, are here 
glossed over by the very molar assumption that the exponent for the 
gradient of abience is double that for adience. For a genuinely precise 
deduction it would be necessary also to show that strength of pull when 
restrained has a determinate relationship to reaction potential as quanti- 
fied in the system. Despite the appearance of precision, and of tie-in 
with his system, Hull’s deduction is by no means rigorous and represents 
little, if any, advance over the simpler presentation of F: igure 22, 

Proceeding in somewhat similar manner, now that the general prin- 
ciples of adience and abience were accepted and in some sense justi- 
fied, Hull was able to combine the principle of the habit-family hierarchy 
with that of the gradient of reinforcement to explain the attraction of 
goals beyond barriers, and of the paths taken around such barriers, These 
are the kinds of problems in open space emphasized in field the 
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ories such 
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as Lewin’s. Hull was willing to go so far as to borrow portions of the 
physical analogies used by the field theorists: 


Assuming as a first approximation that behavioral vectors in quite naive 
subjects operate roughly as physical vectors, even a slight mgvement to one side 
of the line will unbalance the otherwise completely oppised reaction tend- 
enciés. ... .38 

Because adience and abience operate in two-dimensional space, Hull 
acknowledged that he had developed a field theory of sorts: 

The theory of adient and abient behavior thus involves examples of bona 
fide field theory, though this theory must not be confused with physical field 
theories, from which the present theory differs in most respects.” 

Hull referred to two chief differences between his kind of field theory 
and physical field theory: first, the organism is not propelled, as a 
physical particle is, by external forces from the field, but the energy 
for its movements comes primarily from the food it eats; second, the 
distances important in physics are actual distances (centimeters or 
meters ) while the distances in his system are in j.n.d.’s, hence defined ac- 
cording to the organism’s reactions. The residual difference between Hull 
and Lewin on these points appears to be almost purely linguistic, for 
Lewin would not have objected to the substance of either of Hull's two 


points. 


Success in Prediction 

Eventually a self-correcting hypothetico-deductive system must be 
judged by its successes, that is, the agreements between predictions and 
observed events, and by its adjustments to its failures. 

On any kind of tabulation, Hull’s several postulate systems have 
fared very well. R i 

A tabulation of the material in the rote-learning book showed evi- 
f the 110 theorems and corollaries. The evidence 


dence bearing on 71 0 
ases (55 per cent), was somewhat ambiguous 


gave clear support in 39 c ) what ar 
in 20 cases ( i per cent), and disagreed with the prediction in the re- 
maining 12 cases (17 per cent).°° l 

The successes represent a substantial achievement, but the failures, 
too, are achievements, in that they show Hull’s statements to have been 
made clearly enough that data could contradict them. 


93 Hull (1952a), page 242. 
9 Hull (1952a), page 269. 
3 Hilgard (1940). 


ay 


174 THEORIES OF LEARNING 

Hull presented his own summary of successes in prediction in the 
final book.” He classified the evidence bearing on 121 of 178 theoretical 
propositions in that book. Of these, he found 106 (87 per cent) substan- 
tially validated, 14 (12 per cent) probably valid, but uncertain, and 
only one (1 peste) definitely invalid. 

This very high agreement is somewhat disturbing at this stage in 
science, even though the ultimate aim is exceptionless validity. Too high 
agreement, when the system is as unfinished as this one, suggests either 
(1) low standards with respect to what constitutes agreement, or (2) too 
great reliance on working backward from data, with ad hoc assumptions 
to guarantee agreement. Guaranteed agreements should not be tabulated 
as verifications. Working backwards is not, in itself, a defect in a theo- 
retical system. In fact, there is no real alternative, for the improvement 
in a system comes precisely through “working backwards” to obtain 
better agreement with reality. But there are rules by which the deduc- 
tions remain coherent and internally consistent, and these are often 
difficult to apply. As Hull pointed out, the 55 predictions in his book not 
covered by known relevant evidence will provide a good test of the fer- 


tility of his system, because they are free of the charge of working 
backward from the known. 


ESTIMATE OF HULL’S POSITION 


Hulls Position on Typical Problems 


Because he was willing to face the problems of learning posed by 
others as well as those which he set himself, it is possible to assign Hull 
a position on the representative problems chosen as the basis for compar- 
ing the different points of view. 

1. Capacity. A volume on individual differences on which Hull had 
been working was not completed by the time of his death. He published 
but a single paper on the problem 7 and did not carry the analysis much 
further in his final book. Individual differences in capacity will be re- 
flected in the constants that appear in the behavioral laws. Once these are 
under control, it will be possible to make fundamental attacks on such 
problems as that of the relationship between learning and persistent 
individual differences. 

2. Practice. Mere contiguous repetition does nothing but generate 
inhibition; all improvement depends upon reinforcement. Hull is in this 


96 Hull (1952a), 351-353. 
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respect in agreement with Thorndike and Skinner, and opposed to Guth- 
rie, Because the amount of reinforcement does not affect habit strength, 
provided some minimum amount is present, the number of reinforce- 
ments is the basic variable in acquiring habit strength. 

3. Motivation. The basic paradigm for reinforcement lies in need 
reduction, as when food relieves the body's need for sustenance, or when 
escape saves the organism from injury. Thus the primary reinforcing 
quality of reward and of punishment is the same: food reward relieves 
hunger tension, escape from shock reduces shock tension. While these 
are the underlying biological facts, the reinforcement itself is mediated by 
stimulus reduction, that is, by reducing the stimuli associated with the 
drive, rather than by satisfying the drive itself. Secondary reinforcement 
can be provided by any stimulus regularly associated with primary rein- 
forcement. 

yDrive is complexly related to learning: (1) it provides the basis 
for both primary and secondary reinforcement, (2) it activates habit 
Strength into reaction potential, and (8) it provides differential internal 
stimuli that guide behavior. ( 

More complicated relationships involving anxiety, avoidance, ex- 
pectation, frustration, conflict, may be derived from the more primitive 
principles of primary and secondary reinforcement. Important inter- 


mediate derivations include adient (approach) behavior to perceived 


positive incentives and abient (withdrawal) behavior to perceived nega- 


tive incentives. : 
4. Understanding. The organism's own responses furnish the surro- 


gates for ideas. The fractional antedating goal response (rg) provides 
whose sole function, in some instances, is to guide behavior. 
ch stimuli are called “pure stimulus acts.” Ideas 
lity that Guthrie also assigns to them. 
Two intermediate mechanisms emerge as important in meeting the prob- 
lems of behavior in space, and in problem-solving generally. These are 
the principle of the gradient of reinforcement, extended to include objects 
perceived at a distance, and the habit-family hierarchy, which permits the 
maximum utilization of past experience in the solving of present problems. 
Both principles depend for their effectiveness upon aroused fractional 
anticipatory responses, and upon discriminations among the stimuli that 


these responses produce. 

5. Transfer. There are tw 
and equivalence of responses. 
on the basis of generalization, 


stimuli (sg) 
Responses that provide su 
thus have the substantive qua 


aspects to transfer: equivalence of stimuli 
Hull explains equivalence of stimuli either 
or via intermediate reactions. Equivalence 


eat 
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of responses depends in part upon response oscillation, and the generaliza- 
tion there involved; it depends also on the organization of responses into 
hierarchies by way of the habit-family hierarchy. All responses in the 
hierarchy have in the past led to the same goal, so that they are in that 
respect equivalent., 

J. Forgetting: In the volume on rote learning, the decay of excita- 
tion is postulated by Hull and his associates to occur according to a kind 
of law of disuse, making forgetting a function of time. He reaffirmed 
this position in one of the corollaries of the 1943 book, but did not pro- 
vide for it in his postulate set.” I find no reference to forgetting as such 
in the 1952 book, the only references being to the rise in the curve of 
reminiscence, without discussion of the subsequent fall. The only ac- 
quired function that decays systematically with the passage of time is 
reactive inhibition (Ir). 

It is pertinent to refer here to Hull's own summary of the basic 
principles as he perceived them, after completing the system reported 
in his final book. He listed and described eight automatic adaptive be- 
havior mechanisms. 

1. Inborn responses tendencies (sUr) provide the first automatic 
mechanism for adapting to emergency situations. 

2. The primitive capacity to learn is the second mechanism, “a 
slightly slower means of adaptation to less acute situations.” 

3. The antedating defense reaction, arising through learning com- 
bined with stimulus generalization, provides the third adaptive mech- 
anism. 

4. The extinction of useless acts, negative response learning, is the 
fourth mechanism. 

5. Trial-and-error learning is the fifth mechanism. 

6. Discrimination learning is the sixth mechanism. 

7. A second type of antedating defense reaction, depending upon 
the persistence of stimulus traces (rather than upon generalization, as in 
the case of a perceived dangerous object), is the seventh mechanism. 

8. The fractional antedating reaction (rę) with its proprioceptive 
stimulus correlate (sq), provides for the “automatic (stimulus) guidance 
of organismic behavior to goals.” This eighth and final mechanism is the 
crowning achievement of the system. 


Further study of this major automatic device presumably will lead to the 
detailed behavioral understanding of thought and reasoning, which constitute 
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the highest attainment of organic evolution. Indeed the rg —> sę mechanism 
leads in a strictly logical manner into what was formerly regarded as the very 
heart of the psychic: interest, planning, foresight, foreknowledge, expectancy, 
purpose, and so on.*® 


The Status of Hull’s Final Reinforcement Theory 


Hulls theory is not a highly integrated one, so that no one concept 
is truly central to it. If one accepts his own summary, according to the 
foregoing eight mechanisms, the fact of learning is important, but the 


particular mechanism of learning is not crucial. Perhaps the most crucial 
concept is that of the re-se mechanism, which has only a tangential rela- 
tionship to reinforcement theory. Nevertheless, the most controversial 
issue among the S-R theories concerns the status of reinforcement. 

Hull's vacillation between the items to be reduced in primary rein- 
forcement led him to reject D, to turn to Sp, then to add sg as an after- 
thought. There was no finality to his interpretation of primary reinforce- 
ment. We shall return to this issue again in Chapter 12, when we con- 
sider the views of some of his followers, and some recent experiments. 

He was also quite unclear as to what lay beneath secondary rein- 
forcement, although he was clear as to its importance. A stimulus as- 
sociated with primary reinforcement became at once (1) a conditioner of 
secondary drive, hence a producer of Sp, and (2) a secondary reinforcer, 
and hence the equivalent of reduced Sp. Hull was not really clear about 
this, because he seems not to have thought about both secondary drive 
reinforcement at the same time. His thinking was par- 
ticular, not general. When he thought of secondary drive, he thought of 
what Miller and Mowrer talk about as fear or anxiety. When he thought 
nforcement, he thought of such things as distinctive end- 
t when he wrote his formal 


and secondary 


of secondary rei 
boxes, or tokens substituting for food. But wh } l 
propositions, they implied that a stimulus associated with reinforcement 


could become at once both a drive and a reinforcing agent, for the con- 
ditions necessary to become a secondary drive appear to be the same as 
the conditions for becoming a secondary reinforcing agent (Corollaries 
i and ii). The literary form of the two corollaries differs somewhat, and it 
may be that Hull intended a difference, but if he did he was not clear. 
Hence these fundamental matters—the nature of primary and sec- 


ondary reinforcement—he really left for others to settle. 


29 Hull (1952a), page 350. 
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The Quantification of Reaction Potential 


Just what did Hull achieve in the effort to quantify reaction potential, 
a task to which he devoted much effort, and in the success of which he 
took some satisfaction? 

It is worth taking a look at what he did, because the problem of 
assigning quantities to intervening variables is one that other theorists 
who use such variables (e.g., Tolman) must eventually face. 


OUTPUT. 
VARIABLES INTERVENING VARIABLES VARIABLES 


FIGURE 24. The logical problem of measuring intervening variables. 

A simplified version of Hull’s system is presented in Figure 24. It 
omits generalization effects, K, V, and inhibition, but it includes the ‘most 
important intervening variables „Hp, D, and sEn, circled in the diagram 
to distinguish them from the input and output variables. The problem is 
to get from the external world of input and output to these intervening 
variables, in terms of which the most important equations are to be writ- 
ten. If we assume everything else constant, our fundamental equation 
is: 

sEr = fa (sHr, D) (1) 

As a first approximation Hull assumed the equation to be of the 

form: 


sEr = sHp X D (2) 
But while we are about the problem of strict quantification, we can- 
not make this assumption. It is part of our empirical-theoretical problem 
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to determine the form of the f, function. We cannot move from equation 
(1) to equation (2) without proof. 

Now, in general, we have three helpful contributions to determining 
the f functions, because each of the intervening variables is anchored to 
input or output. We have the three functions: 


sHr = f, (N), where N is the number of reinforcements (3) 
D = f, (h), where h is hours of tood deprivation (4) 
sEp = f, (stn), where stp is the latency of response (5) 


Were we able to determine f,, fẹ and f, we should have some 
chance of determining f, We can find equations in Hull’s system for 
each of these functions, so we might assume that he had followed out 
this program. Actually he did nothing of the kind. 

The only function indicated in Figure 24 that is actually given in the 
data is f,, which can be stated: 

str = fa (N) (6) 

That is, we know from experiments that in some instances latency 
of response becomes progressively shorter as the number of reinforce- 
ments increases, and we can fit curves to our results, or scale our results, 
if we wish. Hull and his collaborators proceeded to scale this function, 
and then named the scaled value sEx. That is, the f, function is deter- 
mined on the assumption that reaction potential bears a one-to-one cor- 
respondence to scaled latency. Then 

sEy = fy (str) (7) 

Now we are able to make an algebraic manipulation between equa- 
tions (6) and (7), substituting sEr for str in equation (6), and we ap- 
pear to haye measured an intervening variable: 

sEr = fy(N) (8) 

This is really the extent to which Hull, strictly speaking, quantified 
intervening variables. The f, function was never determined from inde- 
pendent computations of sHx, D, and sEr by way of the appropriate f,, 
fa and f, functions. . 

A further step was taken to complete the picture by using the data 
from Perin (1942), in which a functional relationship was established 
between hours of food deprivation and latency of response. Again by 
using the f, function shown in Figure 24, it was possible to state a re- 
lationship between hours of food deprivation (h) and reaction potential 
(sEn). But this depended solely on the relation between (h) and (str) 
and still did not solve the problem of the f,, fa» and f, functions, which is 
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really the heart of the problem of assigning quantities to intervening vari- 
ables. Hull was rightly aware of the task that needed to be done, but he 
failed to accomplish it. 

The quantification task that was carried out so carefully—the scaling 
of latency—was not a total loss, for in the process some contributions 
were made to scaling theory; and the new position given behavioral 
oscillation in the system led to some interesting implications, as we saw 
in the example of predicted alternation sequences. 


How Satisfactory a System Did Hull Leave? 


Gulls system has many points of superiority over any other existing 
psychological system) It is at once comprehensive and detailed, theoreti- 
cal, yet empirically quantitative. It is easy to locate faults within it be- 
cause it is so carefully worked out, so explicit and mathematical that 
its errors of incompleteness or inconsistency are easily brought into focus. 
A theory expressed solely in the ordinary literary language may sound 
very plausible because the gaps in the theory are glossed over through 
cogent illustrations. A theory such as Hull’s calls attention to itself when- 
ever it jumps a gap. In criticizing Hull's theory we must not lose sight of 
the fact that, with all its weaknesses, it is a major achievement. 

We do well to think of Hull's system as really twofold. On the one 
hand{he was embarked upon a bold and comprehensive theory of be- 
havior,/a theory he hoped would serve as a basis for much of social sci- 
ence. On the other hand, he was experimenting with a very precise minia- 
ture system, with determinate constants based upon controlled experimen- 
tation. He attempted to combine these two enterprises at once, and was 
not very skilled at distinguishing between what he accomplished on a 
large scale and on a small scale, for he wanted the whole to be one sys- 
tem. 

On a large scale, when dealing with behavior in free space, problem- 
solving, and ideas, Hull made skillful use of peripheral mechanisms, 
particularly the hypothesized r,—s¢ sequence. By interlocking these 
goal anticipations into other features of his system, particularly through 
the gradient of reinforcement and the habit-family hierarchy, he was able 
to make large-scale deductions of familiar forms of behavior. When mov- 
ing on this large-scale level, the theory was very “molar” indeed, and al- 
most no efforts were made to pin down precisely the kinds of anticipatory 
movements (chewing movements, bodily postures) that would serve as 
the tangible base for the important fractional antedating response. The 
choice of this mechanism was made (as a surrogate for ideas) because it 
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was ponderable, and hence, in principle, at least, subject to measurement. 
One can easily think of possible experiments to test aspects of the ra 
mechanism, especially the grosser aspects.’ If experiments cannot be 
designed to test the subtler rs mechanisms, then one might as well deal 
with “ideas” as with “pure-stimulus acts.” The argument that the charac- 
teristics of these acts can be inferred, if not measured, has some appeal, 
but it is possible to infer discriminations between ideas or engrams, too. 
Perhaps rg need have no more tangible reality than sHx, but the system 
always implies that it is a genuine response, with proprioceptive conse- 
quences, It may be noted also that the large-scale deductions, built on 
fo, do not require any one special theory of learning, so long as goal 
anticipation can be achieved by that theory. 

When Hull was operating on a smaller scale, and attempting to be- 
come precise and quantitative, he became highly particularistic, confining 
many of the postulates and corollaries to the results of single experiments 
done with rat bar-pressing in a Skinner box modified so that a latency 
measure could be secured. Hull became so preoccupied with this quanti- 
fication that he failed to distinguish between this exercise in miniature- 
system construction and the larger task on which he was simultaneously 
engaged. He therefore began to write into his basic postulates what were 
really exercises in quantification. Surely a set of laws of mammalian 
behavior ought not reflect in the constants of its basic postulates such 
specialized information as the most favorable interval for human eyelid 
conditioning (Postulate 2), how many days without food weakens a rat 
(Postulate 5), the weight in grams of food needed to condition a rat 
(Postulate 7), a rat’s reaction time (Postulate 13), or the amplitude in 
millimeters of galvanometer deflection in human conditioning (Postulate 
15). 

It would have been preferable to distinguish between the tentative 
generalization and then separately to offer the quantitative evidence as 
illustrative. An alternative would have been to limit the miniature system 
to the rat in the bar-pressing box, with every effort to be systematic and 
to make the various constants interchangeable, to make a determined 
effort to measure intervening variables within this one limited universe. 


Then the particularistic postulates would have been appropriate, as other 


Postulates were appropriate to rote learning. 
The source of these deficiencies in the system cannot easily be speci- 
fied. It is difficult to distinguish between weaknesses inherent in the 


System, and problems that were Hull’s as a theorist and system-maker, 
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We have two illustrations in his career of systems beginning with a 
large sweep and then, after heroic quantitative effort, ending so par- 
ticularistic as to be much less inviting than they were before the effort 
was made. The first was the rote-learning theory, which, in its grosser 
form +% appeared more inviting than in its better worked-out form.?° 
So many ad hoc assumptions had to be made in the more precise formu- 
lation that little has been heard from this since. Now we find a similar 
progression in the theory of adaptive behavior. The early papers, in the 
1930's, gave a new outlook upon the systematic prediction of behavior 
in a great variety of situations. The postulates were based largely upon 
the interpretation of maze behavior (simple trial and error, and serial 
learning ). Now, as the system became more precise, these larger predic- 
tions were mostly carried over from the past; the new postulates in both 
1943 and 1952 became so tied to a limited kind of experiment (chiefly 
rat bar-pressing) that they are little likely to survive. It may be that 
particularism is an inevitable consequence of seeking precise quantita- 
tive prediction in the present stage of behavioral science. If so, it means 
that the theorist, at least for a time, will have to be modest in what he 
attempts to do, and may have to be satisfied to remain within a minia- 
ture system, If particularism is not the inevitable result of the scientific 
logic used, then it may be that something was wrong with the kind of 
concepts employed. The additional possibility is that some limitation 
in Hull as a systematist led him to particularism. Whatever the reason, 
Hull’s followers will undoubtedly be able to build on his foundation to 
produce a more satisfactory postulate set than he left. 

It must be acknowledged that, Hull’s system, for its time, was the 
best there was—not necessarily the one nearest to psychological reality, 
not necessarily the one whose generalizations were the most likely to 
endure—but the one ĝyorked out in the greatest detail, with the most con- 
scientious effort to be quantitative throughout and at all points closely 
in touch with empirical tests. Furthermore, it may well be said to have 
been the most influential of the theories between 1930 and 1950, judging 
from the experimental and theoretical studies engendered by it, whether 
in its defense, its amendment, or its refutationMts primary contribu- 
tion may turn out to lie not in its substance at all, but rather in the ideal 
it set for a genuinely systematic and quantitative psychological system 
far different from the “schools” which so long plagued psychology. ` 
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Chapter 6 


TOLMAN’S SIGN LEARNING 


: 
lige theory of Edward C. Tolman was called purposive behaviorism 
in his major systematic work, Purposive behavior in animals and 
-men (1932). Since then he (and others) have called it a sign-gestalt 
theory, or an expectancy theory. These later 
terms all emphasize the cognitive nature of the theory, which distin- 
guishes it in certain respects from the stimulus-response theories of Thorn- 
dike, Guthrie, Skinner, and Hull. The designation sign learning provides 
me, \abbreviating sign-gestalt and sign-significate, 
while calling, attention to the cognitive reference within the theory. 
' | Tolman acknowledged the complex affiliations of his system—with 


Watson’s behaviorism, McDougall’s hormic psychology, Woodworth’s 
stalt psychology (in both classical and Lewin’s 


ed a number of parallels between his system and 
the probabilistic functionalism developed independently by Brunswik.* 
Still later, the influence of psychoanalysis began to be noticeable.* Be- 
cause of these complex affiliations it is appropriate to consider his sys- 
tem after our familiarity with the other behaviorisms, and before turn- 
| ing to the non-behavioristic theories. 

: i Despite some shifts in vocabulary, Tolman has held firm to his main 
tenets during the more than twenty years since his book appeared: 

1. His system is a genuine behaviorism, and as such rigidly rejects 
introspection as a method and “saw feels” as data for psychological sci- 
ence. \W hen he makes reference to consciousness, to inventive ideation, 
“and the like, he is talking about interpretations of observed behavior. 
He does not accept “verbal 3 report” as a dodge by which to smuggle 


fo consciousness in through the back door. 
| yo 2 The system is a molar, rather than a molecular, behaviorism. An 
4 J 


j 2 Tolman and Brunswik (1935). 
f 2 Tolman (1942) (1943). 
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act of behavior has distinctive properties all its own, to be identified and 
described irrespective of whatever muscular, glandular, or neural proc- 
esses underlie it\The molecular facts of physics and physiology upon 
which behavior rests have identifying properties of their own, which are 
not the properties of behavior as molar. This means for Tolman an in- 
dependence from physiology, a characteristic which he shares with 
several of the writers whom we have considered. 

( 8. The system is a purposivism, but of a sort to avoid the implica- 
tions of a teleological metaphysics. It is a purposivism because it recog- 
nizes that behavior is regulated in accordance with objectively determi- 
nable ends} It is not mentalistic; purposes are not those of a self-conscious 
mind. It isnot in agreement with teleological points of view which make 
effects take precedence over and determine their causes. 

The strongest rejection is of American structuralism, because struc- 
turalism was dependent upon introspection of the most offensive sort, 
that known as Beschreibung. Watsonian behaviorism is almost as vigor- 
ously rejected, because it was not only molecular but tended to neglect 
the problems of goal-seeking behavior. 


THE SYSTEMATIC POSITION 
Behavior as Molar 


The descriptive properties of molar behavior are the most general 
characteristics of behavior which would impress themselves upon an in- 
telligent onlooker without prepossessions and before any attempt to ex- 
plain how the behavior comes about. 

First, behavior is goal-directed. It is always a getting-toward some- 
thing, or a getting-away from something. The most significant descrip- 
tion of any behavior is what the organism is doing, what it is up to, where 
it is going. The cat is trying to get out of the box, the carpenter is build- 
ing a house (or earning a living), the musician is seeking acclaim for his 
virtuosity. The particular movements involved are less descriptive of 
the molar behavior than is the goal toward which or away from whick 
the movements lead. This feature characterizes molar behavior as pur- 
posive. 

Second, the behavior makes use of environmental su orts as 
means-objects toward the goal. The world in which behavior tres on i 
a world of paths and tools, obstacles, and by-paths with which the or- 
ganism has commerce. The manner in which the organism makes use of 
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paths and tools in relation to its goals characterizes molar behavior as 
cognitive as well as purposive. TE Ie 

Third, there is a selective preference for short or easy means-activities 
as against long or difficult ones, called the principle of least effort. 

Fojurth, behavior, if it is molar, is docile. That is, molar behavior is 
characterized by teachableness. If it is mechanical and stereotyped, like 
a spinal reflex, it belongs at the molecular level.{Docility is said to be a 
mark òf purpose.’ 

l 
Intervening Variables rd 

The complete act of behavior is initiated by environmental stimuli 
and physiological states. Certain processes intervene, and behavior 
emerges. | Programmatically, this is the formula which Hull took over 
fom Tolnan. The problem of psychological analysis at the molar level is 
to infer the processes which intervene between the initiation of action in 
the world of physics and physiology and the resulting observable conse- 
quences, again in the world of physics and physiology. Because all of 
the data are rooted in this world, the system remains a behaviorism. 

In spite of his methodological behaviorism, Tolman is clearly bent 
on making a “psychological” as against a “physiological” analysis. (The 
intervening variables include such processes as cognitions and purposes, 
so that, on one side of its ancestry, Tolman’s position belongs with the 
gestalt psychologists who have been characterized as “centralists,” rather 
than with the stimulus-response psychologists characterized as “periph- 
eralists.” 4 

The precise variables entering into behavior determination have not 
remained fixed in Tolman’s later discussions, but his logic of system- 
making has remained the same. The set of terms used in his presidential 
address before the American Psychological Association in 1937° may 
serve as illustrative. 

The background of physiology and physics with which choice-point 
behavior begins is defined by environmental and individual difference 


variables: 


I. Environmental Variables 


M—Maintenance schedule 
G—Appropriateness of goal object _ 
S—Types and modes of stimuli provided 


3 In this Tolman follows Perry (1918). 
4 Murray (1938); Leeper (1944). 
5 Tolman (19382). 
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R—Types of motor response required 
x (OBO)—Cumulative nature and number of trials s 
P— Pattern of preceding and succeeding re87° mate 


‘hysiolog 

Il. Individual Difference Variables ee 
~ H—Heredity ‘Tol i 
Tolman 

- A—Age 1 
T—Previous training e share 
E—Special endocrine, drug or vitamin cond | 
the in 


use it PS be 
ely geonship 
elf-conttning: 


It is possible to study the effect of such yes on resulti! 
havior. The usual learning curve is a plot of tiptional aie 
under stated conditions. These are the behaviorits" about Jel 


n intervening Ples. These a re in- 
ferred processes ; & 
= oe i between the independent varid (stimuli, ete, E and 
p t variables (responses, ete.). The \ Minary list asih į pre- 
sented co-ordinates one intervenin . bl Ach of th Sive | | 
J y j on: 
mental variables, B TOABE We © Si as yon 
i ; ne 
Intervening Variable Enwircantal Variable 
pee correlated with Maintenance schule 
Diterentiation “ 2 Appropriateness goal object 
Motor skill “ «Types and modest stimuli provide | id 
Hypotheses a «Types of motor ræonse required 
Be i x «Cumulative natur@od number of t \ials 
iase Pattern of precedng and Susceeding n | 


maze units 
Although the intervening variables may sound subjective, in pring hei 
ple each can be given objective definition and measurement throws h a 
defining experiment in which everything else is held constant ae dd the 
correlative environmental variable while that one is sYstemáti 


; iatically varied. 
Demand, for example, may be expected to increase faith: the number of 
hours since feeding, but the relationship between fo. q deprivation and 
demand is not a simple one. It must be studied Empirically, The same 
holds for each of the intervening variables. 

Having thus established a basis for inferring the value of the inter- 
vening variable from the antecedent conditions, the next stage in theory 
construction is to find the equations relating intervening variables to 
behavioral outcomes, as these intervening variables are simultaneously 
varied. 

® (OBO) is a shorthand formula which means some consequence or summation of 
previous experiences in which one occasion (O) has led through behavior (B) to an- 


other occasion (O). The occasions are such features as a choice point, a goal at the 
left, and so on. 
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In spite of the clear outline of what a systematic theory ought to be, 
Tolman has nowhere attempted quantitative predictions paralleling those 
of Hull, so that his conjectures have not in that sense been put to the 
test.? This does not mean that his experiments are unrelated to his theory. 
There are, in fact, many predictions, but they assert that one path will 
be preferred to another, that under one set of circumstances the problem 
will be easier than under another set, and so on. The dimensional analy- 


sis which completes the function is not provided, and Hull’s conscientious 
efforts are instructive in showing how difficult that task is likely to be. , a 


Recent Formulations in Relation to Earlier Ones 
There have been no radical revisions in Tolman’s systematic think- 
ing, although various schemes have been tried out for classifying the 


phenomena of learning, and a number of diagrams have been prepared 


to provide models for thinking about learning and other aspects of be- 


havior, The confident insistence upon building all principles so as to be 
derivable from and applicable to rat behavior (possibly to guarantee that 


the system remained a behaviorism) seems to have been somewhat re- 


laxed. Some of the later papers attempt to generalize the interpretation 
of motivation and cognition in a way useful to social science generally. 
In these later papers more illustrations come from child-rearing and hu- 
man behavior than in the earlier writings. 

The most elaborate model since the first book holds to the original 
pattern of independent variables, intervening variables, and dependent 
s The changes from earlier to later models do not re- 


(action) variables.* 
flect any fundamental reorientation, although there are some vocabulary 


differences. (he three chief intervening-variables in the 1951 model are 
(a) the need system, (b) the belief-value matrix, and (c) the behavior 
space, - 
\ a need system bears a family resemblance to the drive theories 
i ; a : 
with which we are already familiar. (The needs arise either through 


physiological deprivation or through psychologically defined drive con- 


ditions, ) 
The behavior space i 
objects in various places, wit 
actor at a given momen 
“valences” on them, and th 
behavior space is very similar to 
ego Tolman (1939) (1941). 


n which “locomotion” takes place is a space of 


h distance and direction, as perceived by the 
t) These objects have positive or negative 
e actor is attracted or repelled by them. The 
Lewin’s life space. We shall consider 


T Some starts have been made, 
$ Tolman (1951b) (1952b). 
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this kind of construct when Lewin’s views are discussed in Chapter 8. 
If we have become accustomed to stimulus-response language, we 

are likely to be confused initially by some of the language the cognitive 
theorist uses, especially his physical or geometrical metaphors. When 
the stimulus-response psychologist refers to an object in space, he means 
that physical object, or at least those aspects of it which, as stimuli, can 
impinge upon the subject’s receptors. When the cognitive theorist refers 
to an object in space, he is commonly talking about that object_as per- 
ceived, or as it is expected to be when perceived. It is important to re- 
member that perception and perception-like processes are central for him. 
Thus when Tolman assigns valence to an object he does not mean that 
some mysterious attracting or repelling force has been added to a physi- 
cal object. He means only that the object as perceived or as expected is 
as an object-to-be-sought or as an object-to-be-avoided. We may recall 

William James’ description of a nestful of eggs as perceived by a broody 
hen: “the utterly fascinating and precious and never-to-be-too-much-sat- 

upon object.” For the hen, the nestful of eggs has positive valence, 
UY The heliefvalue matrix holds most interest as a possible supple- 
mentation to learning theory. This matrix (or, really a set of matrices) 


includes learned categorizations and differentiations concerning environ- 


mental objects. Objects categorized in a common class have gradients of 
generalization built up, so that the objects come to have a preferential 
or hierarchical order, similar in effect to Hull’s gradients of generaliza- 
tion or his habit-family hierarchy. Tolman points out how little infor- 
mation we have about any of these gradients, except for simple sensory 
qualities. What he has in mind is something more complex. He uses as 
an illustration the functional dimension described by Sears and his co- 
workers for fantasy aggression in the doll-play of children.” These in- 
vestigators found that doll-figures showed a consistent order of “psycho- 
logical distance” from the child, as revealed by the amount of a 
either attributed to them, or addressed to them. The order—f; 
to far—is the child itself, then the parent doll of the same sex, the parent 
doll of the opposite sex, the child doll of the same sex, the child doll of 
the opposite sex, then the baby doll, then puppets or imaginary forces. 
Tolman assumes that such a gradient would be learned in our culture 
and would constitute then one item in the belief-value matrix, 

To carry further with this system, Tolman points out that it would 
be necessary to provide operational definitions for all the following terms: 

® E.g., Sears (1951). 


ggression 
rom near 
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VARIABLES TO BE OPERATIONALLY DEFINED 1° 


Need Systems 
a. List of needs 
b. Magnitude of a given need at a given moment 


Belief-Value Matrices 
c. The magnitude of the gratification and deprivation “values” of the 


given matrix at a given time. 
d. The shape of the cathexis belief attaching the various types of goal 
object (arrayed along a given generalized dimension) to the gratifi- 


cation end of a given matrix.” 
e. The shape of the means-end belief attaching types of means object 


(arrayed along a given generalization dimension) to a given type of 
goal object or subgoal object. 
Behavior Spaces . 
f. Perceived qualities, distances, and directions in the behavior space. 


The strength of a need-push. 

The strength of a positive or negative valence. 
The strength and direction of a field force. 
The identification of a locomotion. 


It is evident enough from the unfinished business implied in this 
list that, whatever new directions are proposed for the model, the new 
model is not yet ready for detailed critical appraisal. 


=- TS 


Ysien LEARNING! 


( Sign Learning as an Alternative to Response Learning 

l Stimulus-response theories, while stated with different degrees of 
sophistication, imply that the organism is goaded along a path by inter- 
nal and external stimuli, learning the correct movement sequences so that 
they are released under appropriate conditions of drive and environ- 
mental stimulation. The alternative possibility is thatthe learner is fol- 
is learning his way about, is following a sort of 
map—in other words, is learning not movements but meanings. This is 
the contention of Tolman’s theory of sign learning} The organism learns 
sign-significate relations; it learns a behavior route, not a movement pat- 
tern. Many learning situations do not permit a clear distinction between 


lowing signs to a goal, 


10 iti verbatim from Tolman (1951b), page 334. 

1 ene G o E or selective appentis whereby one goal object is 
valued above another as a means of obtaining gratification. The geometrical concept 
(“shape of cathexis belief”) refers to some sort of scaling of a series of possible goal- 
objects (e.g., foods) on a preference dimension, some of them being closer together 


some farther apart. (Tolman, 1951b, pages 337-338. ) 
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these two possibilities. If there is a single path with food at the end and 
the organism runs faster at each opportunity, there is no way of telling 
whether his responses are being stamped in by reinforcement or whether 
he is guided by his immanent purposes and cognitions. 

Because both stimulus-response and sign learning so often predict 
the same behavioral outcome, it is necessary to design special experiments 
in which it is possible to favor one theory over the other #firee situa- 
tions give strong support to the sign-learning alternative. These are ex- 

eriments on reward expectancy, on place learning, and on latent learn- 
he À TRS -, Bii 

| / 1. Reward expectancy. One of the earliest and most striking observa- 
tions on reward expectancy was that of Tinklepaugh (1928). In his ex- 
periment, food was placed under one of two containers while the monkey 
was looking but prevented from immediate access to the cans and food. 
Later the monkey was permitted to choose between the containers and 
showed skill in choosing correctly. (The behavior which is pertinent here 
occurred when, after a banana had been hidden under one of the cups, 
the experimenter substituted for it a lettuce leaf (a less preferred food). 
The monkey rejected the lettuce leaf and engaged in definite searching 
behavior. Somewhat the same sort of behavior was found by (Elliott 
(1928) when the food in the goal-box of a rat maze experiment was 
changed from bran mash to sunflower seed. More systematic experiments 
have been carried out since with chimpanzees.: (There is little doubt 
that animals have some sort of precognition or expectancy of specific 
goal objects. Under those circumstances, other goal-objects produce signs 
of behavior disruption. Such behavior means that the sign-learning theory 
is appropriate; \it does not, of course, mean that other theories may not 
attempt to deduce the behavior from other principles. 

2. Place learning. Experiments on place learning are designed to 
show that the learner is not moving from start to goal according to a 
fixed sequence of movements, such as would be predicted from reinforce- 
ment theories, but is capable of behavior which is varied appropriately to 
changed conditions, as though he “knows” where the goal is, There are 
three subtypes of these experiments. 

The first su’ ‘ype of the place-learning alternative to response learn- 
ing leaves the form of the path intact but interferes with the movement 
sequences in getting from start to goal.\In one experiment, rats that had 
learned to run a maze straight on, after surgically produced cerebellar 


12 Cowles and Nissen (1937). 
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damage were unable to run the maze except in circles, but even so were 
able to run without error." They could not have been running off the 
earlier learned sequences of kinesthetic habits. In another, rats were able 
to demonstrate what they had learned by swimming the correct path 
after having been trained in wading it.** 

The second subtype of place-learning experiment sets a movement 
habit against a spatial habit and determines which is thé more readily 
learned. (Tolman and his collaborators?" arranged an elevated maze in 
the form of a cross, as shown in Figure 25(The response-learning group 
was started in random alternation from either_S, ‘or S, always finding 
food by turning to the right) That is, food was at E, when the start was 


S: 


x Black Curtain Black Curtain ae 


wn 


S, 


elative ea f learning either a response 
25. Maze used to test the relative ease o g er a respons 
ahi Cais m A or the place at which reward is found. By starting irregularly 


i 5 i yard a same food box each time, ‘one group of rats 
at S, and S:, but finding reward at the i jne group of ra 


always_at the same Tocation. Thése 
t ight, now tatl left, to find food always_at the < ogag s 
urns now to the right Oring a either S: or”S:, but always finding food as a 


rou Pr eee ne way (right or left), another group is taught always to make 

the same res a but to find food at different places, depending upon the starting 

point, Place Tinig under the conditions of the experiment, is found to be easier 
. Pla a A 


than response learning. From Tolman, Ritchie, and Kalish (1946), page 223. 


yr. 
Y The place-learning group, by con- 
S, and at F, when the start was S.. f r p, by 
1 z o the same place for-food. This meant that if running 
nt to the same_ 


trast, always went t “ 
to Fy, a right turn would be required when starting from S, and a left 


turn when starting from S.. The place-learning group was much the more 
successful.) The eight rats of the Jace-learni roup all learned within 
f als were without error. None of the eight 
jals were 5 


8 trials, so that the next 10 tri i ; 
rats of the response-learning group learned this quickly, and five of them 
did not reach the criterion in 72 trials. Under the circumstances of an 


13 Lashley and Ball _— 
14 Macfarlane (1930). , 
= Tolman, Ritchie, and Kalish (1946) (1947). 
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elevated maze with many extra-maze cues, it is clearly demonstrated 
that place learning is simpler than response learning.** 

The third subtype of place-learning experiment involves the use of 
alternative paths when a practiced path is blocked. An early form of the 
blocked-path experiment is that of Tolman and Honzik (1932a) which is 
said to demonstrate inferential expectation, or insight. The main features 
of the arrangement are as follows. There are three paths (1, 2, and 8), 


FOOD BOX 


STARTING PLACE 


FIGURE 26. Maze used to test insight in rats. The paths become established as a 
hierarchy according to length, Path 1 preferred to Path 2, Path 2 to Path 3. If Path 1 
is closed by Block A, the rats run by Path 2. If Path 1 is closed by Block B, the rats 
run by Path 8 if they have “insight” that the barrier closes Path 2 as well as Path 1, 
From Honzik and Tolman (1930a), page 228. 


in that order of length from shortest to longest, and hence in that order 
of preference (Figure 26). In preliminary training, when path 1 was 
blocked, a preference was established between paths 2 and 8 for the 
shorter of these paths. Only when path 2 was blocked also, did they 
run by path 8. We may somewhat oversimplify by saying that a familiarity 
with all paths and a preference for them in the order 1, 2, 8 was estab- 


16 The importance of the extra-maze cues has been demonstrated by Blodgett and 
McCutchan (1947) (1948), who have other criticisms of the experiment cited. See 
the experimental reply, Ritchie, Aeschliman, and Peirce (1950). For other criticisms 
of space-learning experiments see Gentry, Brown, and Kaplan (1947). 
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lished in preliminary training. An important feature of the maze design, 
crucial for the test, was that paths 1 and 2 had a common segment Jead- 
ing to the goal. Previously the block in path 1 had been placed before 
this common path; then the rat, after backing up from the block, ran to 
path 2. Now in the test the block was placed farther along path 1, so that 
it fell in the common path. Would the rat in backing out again run by 
the second preference, path 2, and be frustrated, or would it “see” that 
path 2 was also blocked? What the rats did, predominantly, was to avoid 
path 2, and to take the path ordinarily least preferred, the long path 3, 
but the only one open. Again the hypothesis is supported that the rat 
acted in accordance with some sórt of “map” of the situation, and not ac- 
cording to blind habit, or according to the automatic running off of habits 


in hierarchical order.** 
One of the several later experiments using blocked paths also in- 


volved rotation of the starting table through 180° on test trials, as illus- 
trated in Figure 27." The original T-maze used in training is diagrammed 
at the left. From the table-top A the F, rats learned to turn left, in the di- 
rection of the light L,. They were rewarded by food at the end of the run, 
in the place signallized by the light. The F, rats learned to turn right, in 
the direction of the light La. The figure on the right shows how the table 
was moved from the test trials. It was not only moved, but provided with 
10 radiating paths, in addition to the original straight-ahead path, now 
blocked, You would predict from a naive place-theory that the rats, know- 
ing their way around, would take that path leading most directly toward 
the light where they had been fed. With rare exceptions they failed to 
choose the direct path. They did, however, show a direction reversal, the 
F, rats now predominantly turning right rather than left, the F, rats 
turning left rather than right. They responded to the rotation of the 
starting point all right, but they tended to choose the extreme paths 
running most di rectly toward the appropriate wall rather than directly 
to the place where they had been fed. While the results thus lend some 
support to a directional-orientation theory, they shöw that a zats cogni- 
tive map” (if it exists) does not correspond in any simple or direct man- 


ner to the rat’s experienced spatial reality. ) 
3. Latent Learning. In addition to the experiments on reward ex- 


ių experiment has been criticized by other experimenters 

wwe vs Tolmas pa E while reproducible, are ey disturbed by the 

mani rie 3 Pi erimental variables such as alley width whic would not be a 

anaipata a insight impossible. See Evans (1936), Harsh (1937), Keller and Hill 
(1986), Kuo (1937). 
18 Ritchie (1948). 
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pectancy and on space learning, a third variety of experiment bears im- 
portantly on sign learning. These are the latent-learning experiments 
to be described more fully later in this chapter. The latent-learning ex- 
periments show that an animal can learn by exploring the maze, with- 
out food reward, so that, when reward is later introduced, performance 
is better than that of rats without this exposure, and sometimes as good 
as that of rats with many previously rewarded trials. The “latent learn- 
ing” consists of knowledge of the maze, not revealed in choice of the 


[| Rat cages Rot cages 


FIGURE 27. Place orientation in the rat. The dotted figure on the right shows 
the T-maze used in training, the solid figure the rotated maze, with radiating paths, 
as used in testing, In testing, the direct path from the table-top to the original choice- 
point was blocked. After Ritchie (1948), page 664. 


shortest path from entrance to exit until the rat is motivated to make 
that choice. The experiments, beginning with those of Blodgett (1929), 
have been interpreted as critical of reinforcement explanations of learn- 
ing, although, as we have seen, the issue has been somewhat dulled by 
changes in Hull’s interpretation of reinforcement. 

In his own summarization of the experimental work supporting his 
theory, Tolman (1948b) lists two additional varieties as significant: the 
experiments on “hypotheses” in rats,” and the experiments on vicarious 
trial and error (VTE).*° The experiments on hypotheses led to the con- 
clusion that in a four-choice maze the animals adopted systematic modes 

19 Krechevsky (1932a). 


20 Muenzinger (1938), Tolman (1939), Geier, Levin, and Tolman (1941), Jackson 
(1943). 
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of solution, such a choice being somewhat tentative, so that one mode 
would be rejected for another. These experiments furnished the back- 
ground for a controversy over “continuity” and “discontinuity” in dis- 
crimination learning, the hypothesis experiments furnishing support for 
the “discontinuity” interpretation.*t Vicarious trial and error refers to 
the vacillation at a choice-point before the animal is “committed” to one 
or the other choice. This active comparing of stimuli appears to Tolman 
to support his viewpoint that perceptual or cognitive processes are promi- 
nent in controlling behavior at a choice point. 


Expectancy versus Habit 

Can a more precise formulation be made of the sign-learning theory 
—the theory to which the results of all of these experiments are said to 
conform? It has sometimes been characterized as a theory of “what leads 
to what,” a theory of signs, significates, and behavior routes. There are 
really two central problems in a theory of this kind: first, how is the 
expectation acquired that a given sign will lead to its significate, and, 
second, how is this expectation translated into action? We begin with 
the problem of acquiring the expectation. 

The theory is that with repeated experience of a sequence of events 
leading to a goal the probability is discriminated that the given behavior 
will eventuate in the expected end result. The result (for behavior) is 
faster running and blind-alley elimination, exactly as if a habit were 
being strengthened by reward, Hence some ingenuity is required to iso- 
late circumstances in which the expectancy interpretation is more ap- 
propriate than the habit-strengthening one. While the foregoing account 
of experiments gives qualitative support to x cognitive interpretation of 
learning, if expectancies conform to probabilities, a more quantitative 
approach should be possible. In a very stimulating paper Tolman and 
Brunswik (1935) proposed that the causal texture of the environment is 
not such as to permit firm expectations but that predictions must often 
be made on the basis of probabilities. In an experimental test of the 
proposal, Brunswik (1939) presented his rats with such a contingent 
environment, in which food might sometimes be found on the right, some- 
times on the left. He found that there was some measure of agreement 
between the choices of the rat and the probability that food would be 


where he went for it. 
An important supplement ; 
that the nature of the goal provi 


21 For a further discussion of this issue, see Cl 


ary finding in Brunswik’s experiment is 
des an emphasis that makes a great dif- 


hapter 12, 
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ference in the behavioral response to the discriminated probability. He 
found, for example, that rats rewarded on one side of a T-maze 100 per 
cent of their runs to that side, and on the other side in 50 per cent of 
their runs to that side, showed a preference for the two sides in roughly 
the 2:1 ratio of these reward schedules. This preference was tested dur- 
ing the last eight trials of training. During these trials, the rats ran to the 
100 per cent rewarded side in 72 per cent of the runs, and to the 50 per 
cent rewarded side in 28 per cent of the runs. If, however, the partial 
reward side was also “dangerous” on half the trials (a shock being used 
instead of non-reward), this cancelled the effect of the intermittent re- 
ward, and learning was equivalent to that in a conventional T-maze 
experiment in which one side only was rewarded. This, and related re- 
sults, led Brunswik to state: 


Discrimination of tae ae tends to increase with the rate of the proba- 
bility of emphasis on the two sides of the probability discrimination problem.?? 

Common experience amply justifies this supplementation to a sim- 
ple probability theory. One might walk into a store to try to make an 
unusual purchase even though the probability was only 1:10 that the 
store carried the merchandise sought. But one would not buy a ticket 
on an airline where the probability of a crash was 1:10. Discriminated 
probabilities are weighted before they emerge in action. 

In order to compare an expectancy theory with a reinforcement 
theory, Humphreys performed several experiments of the conditioned- 
response type. In human eyelid conditioning, it had already been shown 
that conditioned discrimination was more rapid when subjects knew 
which stimulus of a pair was to be positive, which negative, than when 
the probabilities had to be established through experience with the 
stimuli.2* The conjecture that the subjects 
to their expectations was a plausible one. Humphreys went on to show 
that random alternation of reinforcement and non-reinforcement led not 
only to as much conditioning as reinforcement every trial, but was fol- 
lowed also by greater resistance to extinction.: During extinction, re- 
sponses increased in frequency at first, and then fell off. This would be 
anticipated on an expectancy theory because the likelihood of reinforce- 
ment was great after a non-reinforcement during the body of the ex- 
periment, and greater still after two non-reinforcements because there 
were never more than two successive non-reinforcements during the 


were responding according 


22 Brunswik (1939), page 185. 
23 Hilgard, R. K. Campbell, and W, N. Sears (1988). 
24 Humphreys (1939a). 
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training sessions. But even after this high point, extinction was gradual. 
Humphreys conjectured that a shift from intermittent reinforcement to 
uniform non-reinforcement must have led with difficulty to the hypothe- 
sis of uniform non-reinforcement. Because the experiment was done with 
human subjects, a direct test was possible on the verbal level, and 
Humphreys designed and carried out a simple experiment which con- 
firmed his conjecture." 

For the study of verbal expectations, two lights were arranged on 
a board. When one of these lights was turned on, the subject was asked 
to guess whether or not it would be followed by the other light. Half 
the subjects were “trained” with the second light invariably following 
the first. They came gradually to guess in a high percentage of the cases 
that the first light would be followed by the second, in agreement with 
their experience. The other half had the second light turned on only in 
random alternation, so that half the time it did not appear. They guessed 
at chance level. Within the conditioning experiment, the intermittent- 
reinforcement group performed like the group uniformly reinforced; but 
in the verbal experiment, the groups behaved entirely unlike. Does this 
difference not invalidate the comparison? On the contrary, it is an essen- 
tial part of it, because it is necessary to distinguish between performance 
and expectation. In the conditioning experiment, uncertainty leads to 
conditioned responses as well as certainty because it is a “danger” situa- 
tion; blinking is as easy as refraining from blinking, and there is no 
penalty for an “erroneous” response, that is, for blinking to the condi- 
tioned stimulus alone. In the verbal experiment, response is more nearly 
pectation, because a false guess is subjectively 


rie 
representative of “pure ex : 
hich a false conditioned response 


interpreted as a mistake in a way m W 
is not. 


The crucial portion of the experiment is that in which extinction is 


simulated, that is, in which the first light is never again followed by the 
second. Humphreys’ results are plotted in F: igure 28, It is seen thay the 
group which had been trained on the every-trial reinforcement” quickly 
developed the hypothesis of uniform non-reinforcement, and) ceased to 
expect the second light. The group trained with intermittent “reinforce- 
ment” showed the rise in expectation which the theory demands, and 
the slow acceptance of the hypothesis that there would be no more sec- 


ond lights.?° 


25 Humphreys (1939b). 5 both within conditioning experi 
26 i ittent reinforcement, both withi & experiments 
T E a nis have proved to be of considerable theoretical interest, 
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A provisional expectancy is an hypothesis. When a situation is not 
yet structured so that path-goal relationships become clear, behavior in 
relationship to the situation may either be “random” or “systematic.” If 
it is systematic, it may be said to accord with an hypothesis. Score 
averages from groups of animals tend to conceal the differences between 
chance approaches and more orderly approaches, because one animal's 
responses cancel another’s unless they happen to correspond to the 


o o Following Intermittent 
Reinforcement (507) 

O= == © Following Regular 
Reinforcement (100 Z) 
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"EXTINCTION" TRIALS 


FIGURE 28. A phenomenon like extinction in an experiment on verbal expecta- 
tions. Prior to yielding the data in this figure, one group had regularly experienced 
a second light following a signal light, and had developed the expectation that a 
second light would regularly follow. This condition simulates 100 per cent reinforce- 
ment in a conditioning situation. For the second group, the signal light had been 
followed by the “expected” light only half the time. This condition is like 50 per cent 
reinforcement in conditioning, and leads to uncertainty as to whether or not the signal 
will be followed by a second light. Beginning with the trials shown in the figure, the 
signal light was never followed by the second light, thus simulating extinction, What 
is recorded is the per cent of subjects guessing on each trial that the second light 
would occur. Following regular reinforcement the change to the expectation of regu- 
lar non-reinforcement occurs more readily than following intermittent reinforcement. 
After Humphreys (1939b). 


experimenter’s plan as to what will be counted correct. By studying the 
presolution behavior of individual animals, systematic attempts at solu- 
tion can often be discovered, solutions of the order: “turn right,” “alter- 
nate,” “go to the dark.” These are not mere fixed preferences, because 
after one pattern has been used for a while, the animal learner may shift 


» a 


and the experimental literature is now sizable. For a useful summary, see Jenkins and 
Stanley (1950). On verbal experiments sımilar to Humphreys’, see Grant, Hake, and 
Hornseth (1951), Hake, Grant, and Hornseth (1951). 
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to another one. Krech named such systematic attempts at solution “hy- 
potheses,” and studied them under a number of conditions.** 

The full-fledged conception of expectancy thus includes at least the 
following aspects: (1) The organism brings to a problematic situation 
various systematic modes of attack, based largely on prior experiences. 
(2) The cognitive field is provisionally organized according to the 
hypotheses of the learner, the hypotheses which survive being those which 
best correspond with reality, that is, with the causal texture of the en- 
vironment. These hypotheses or expectancies are confirmed by successes 
in goal achievement. (3) A clearly established cognitive structure is 
available for use under altered conditions, as when a frequently used 


path is blocked.?$ 


Confirmation versus Reinforcement 

Because Tolman has been at such pains to protest against the law of 
effect and the principle of reinforcement as essential to learning, it is in 
order to inquire what role he assigns motivation in learning. 

One role assigned to motivation is in relation to performance. Learn- 
ing is used when drives are active. When drives are aroused, a state of 
tension ensues, leading to demands for goal-objects.** These tensions 
lead to activity, guided by the expectancies or cognitive structures avail- 
able. In this role, then, motivation is not really a factor in learning at 
all. It is related to performance, not to acquisition. Because this role of 
motivation is made much of in the criticisms of reinforcement theories, 
en that learning, the acquiring of cognitive struc- 


the impression is giv ae : isi 
of motivation. This is not true in fact, nor is it true 


tures, is independent 


in Tolman’s theory. 7 eens ate 
The second role of motivation is in the acquisition of cognitive 


structures. In one of the experiments critical of the aww of effect, the 
suggestion was made that what appeared to be the petneipls of effect 
might better be considered the influence of emphasis.*° That is, motiva- 
tion determines which features of the environment shall interest the 
learner, and those to which he pays attention. Such factors are influen- 
tial in perceptual acquisitions, which are the genuine substance of place 
learning. Causal sequences of the environment are not all spatial. Some 


3 1938a) (1988b). 
e Ktechevsky (19920) a OP aon of the attempts of others to systema- 


_°8 We postpone until Chapte 

“ee ae be noted with Hulls interpretation of drive as activating habit 
29 The para ) 

strength into reaction potential. s2 
30 Tolman, Hall, and Bretnall (1932). 
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are temporal; some are logical or systematic or arbitrary. The goal-object, 
by its presence or absence, verifies or refutes hypotheses. Hence the goal- 
object is essential for the establishment of some features of cognitive 
structure. Latent learning, which dramatizes a relatively unmotivated, 
incidental type of learning, is an extreme instance of the relationship of 
goal-behavior to learning and is probably not a typical case. 

The importance of something like emphasis is well brought out 
in the series of experiments by Muenzinger and his associates,*! in which 
punishment or obstacles to be overcome, even on the side of the correct 
choice, aid learning. Whether or not punishment will be an aid to learn- 
ing thus depends upon what influence it has on the cognitions of the 
learner. It may be helpful as an emphasizer or harmful as a distractor. 

The principle in Tolman’s system that most nearly replaces reinforce- 
ment is the principle of confirmation. If an expectancy is confirmed, its 
probability value is increased; if an expectancy is not confirmed, its 
probability value is decreased (i.e., it undergoes extinction). Tolman 
distinguishes between confirmation and reinforcement in a passage de- 
scribing the building up of preferences for one restaurant over another: 

Such learning takes place, I believe, not primarily through reinforcement 
in the Hullian sense but merely by the repeated “confirming” experiences of find- 
ing what restaurants lead to what foods by what costs and with what degrees of 
gratification,” 

The regulatory role of reward is not very different in Hull’s system 
from this role in Tolman’s, though the interpretations differ. According to 
Tolman, the learner acquires knowledge of a means-end relationship; 
according to reinforcement theory, he learns behavior which is followed 
by reward. In any case, it would be most misleading to interpret Tol- 
man’s position as if reward influenced performance only, and not learn- 
ing. All that he insists upon is that learning and performance cannot be 
equated, and the role of motivation in learning differs in some respects 


from its role in performance. This conclusion, originally proposed by 
Lashley (1929), can scarcely be doubted. 


.</VARIETIES OF LEARNING AND THEIR LAWS 
The 1982 Version 


Learning theorists commonly select one kind of learning problem 
or situation as typical and then proceed to develop a theory appropriate 
to this reference situation. Having constructed a set of principles in this 


31 Muenzinger and others, 1934-1952. 
32 Tolman (1952b), page 396. 
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way, they attempt to show by a logical process that other kinds of learn- 
ing are really at base like the typical one, and hence explicable in the same 
terms. Recognizing this tendency, (T olman selected for review three kinds 
of learning experiments, with their three corresponding doctrines. These 
were conditioned-reflex learning (Pavlov), trial-and-error learning 
(Thorndike), and inventive learning (Kéhler). He then gave a sign- 
gestalt interpretation of each of the three,kinds of learning as alternative 
to the usual theory associated with each. He found it useful to preserve 
the typical experiments, which represent a kind of hierarchy from stu- 
pidity to intelligence. The laws applicable to the more stupid situations 
have to be supplemented by additional laws for the higher forms of 
learning. 

In the 1932 version there are three groups of laws: capacity laws, 
laws relative to the nature of the material, and laws relative to the man- 
ner of presentation. 

®© Capacity laws. Only organisms can learn. It is evident, therefore, 
that what the organism can learn must depend on what kindof an or- 
ganism it is. That is the reason for capacity laws. 

~ The list of capacity laws is as follows: * 


a. Formal means-end-capacities _ re 
b. Discriminating and manipulating capacities 


c. Retentivity 
d. Means-end ci 
. Ideational capacities 
f. Creative instability 
arn conditioned reflexes, the learner must have the 


rm and act in accordance with “sign-gestalt- 
expectations.” That is, the conditioned stimulus serves as a sign that 
the unconditioned stimulus is about to appear and the conditioned be- 
havior is appropriate to the sign. This capacity is named a “means-end- 
Capacity » In his later writings Tolman has dropped a number of the 


hyphenated terms which made his book clear, entertaining, but also 
list of laws which we have given above, his 


somewhat forbidding. In the 
terms have been freely paraphrased to make for easier reading, although 
there may be some loss involved. In addition to the general capacity for 


Sign learning conditioning requires special capacities for discriminating 
and manipulating features of the environment. Finally retentivity is im- 
plied, if the results of earlier conditioning trials are to influence later 


ones. Only capacity laws 4, b, and c apply to conditioning. 
Í The capacities needed for trial-and-error learning are the same as 


-capacities needed for alternative routes, detours, etc. 


2 


In order to le 
necessary capacities to fo 


33 Tolman (1932b), pages 376-377. 
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those required for conditioning, except that additional means-end- 
capacities are needed because more alternatives are open to the learner. 
The field relationships of alternate routes, detours, final common paths, 
are involved. Additional capacities of ideational sort, permitting compari- 
son of alternatives (a mental “running-back-and-forth”), are probably 
helpful in trial-and-error learning. 

(Inventive learning requires all the capacities of the other varieties of 
learning plus creative instability) This is a capacity to break out into 
new lines of activity which have never occurred to the learner before. | 

The need for capacity laws seems evident enough, once they are pro- 
posed, though they have been neglected in most learning theories. Even 
Thorndike, strongly identified with the study of individual differences, 
neglects capacity laws in his learning theory. Such a statement as “Any 
response of which the organism is capable can become attached to any 
stimulus to which it is sensitive” * implies only sensitivity capacity and 
response capacity, and neglects any capacity to establish relations be- 
tween them. It would be grossly unfair to say that Thorndike did not 
recognize differences in learning ability, but it is true that he slighted 
the different kinds of capacities needed for different kinds of learning, 
because all learning was merely the forming of bonds. Hull, who also 
in one of his earlier research interests contributed to the psychology of 
individual differences,®* only late in his career began to consider indi- 
viduality as something to enter into his learning theory.** 
~/ 2. Laws relating to the nature of the material. In Tolman’s discus- 
sion of these topics, he calls attention to certain “gestalt-inducing- 
conditions” and suggests that they are of the sorts emphasized in gestalt 
studies of perception. 

The list follows: * 

a. Togetherness 

b. Fusibility 

c. Other gestalt-like laws 


d. Interrelations among the spatial, temporal 
alternatives 


e. Characters in the material favoring new closures and expansions of the 
field 


and other characters of the 


In relation to conditioned-reflex learning, these laws suggest that 
there must be a “togetherness” of essential signs and their means-end- 
relationship to the thing signified. Tolman states that this is about what 


34 Thorndike (1913b), page 15. 

35 Hull (1928). 

36 Hull (1945). 

37 Tolman (1932b), pages 878-385. 
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Thorndike has called “belongingness.” Tolman adds a somewhat similar 
law of fusibility of sign, significate, and signified means-end-relationship 
by which he means a certain naturalness about the situation which makes 
it easier to form a gestalt of the whole} He provides in a third law for the 
possibility of new discoveries. He adds one law (d) for trial-and-error 
re learning (e), to suggest that some arrange- 


learning and one for inventiv 
vi ay PACARAN 
ments must be easier than others. He points to Köhler’s observation that 


the ape could learn to use the stick more easily to rake in the food if 


the stick and food were perceived together. 

These laws show great catholicity, but there is no ordering principle 
among them. “Spatial, temporal, and other” can scarcely be said to ar- 
range things dimensionally. They do make a definite bow to the important 
fact that perceptual principles must be understood if the relative ease 
or difficulty of problematic situations is to be made comprehensible. 

3. Laws relative to the manner of presentation. These are the laws 


inherited largely from association psychology, the ones for which abun- 


dant evidence can be found recorded in McGeoch and Irion’s book 


(1952). The list is as follows: ° 


a, Frequency, recency , 
b. Revival after extinction, 
S Motivation nais 
d. Not “effect” but “emphasis , 

ot enee E s in the presentation of alternatives 


e. Temporal orders and sequence: S i 
f. Temporal relations between the presentation of certain of the already 


given alternatives and the true solution 

Of these, the first four belong to conditioned-reflex learning, the 
first five to trial-and-error learning, and all six to inventive learning. 
equency and recency are accepted in the follow- 
quently and more recently the actual sequence 
and significate have been presented, the 
ial, this resulting sign-gestalt will tend 
favorable to sign-gestalt formation 
laws provide opportunity again 
and to make some observations 
and-error and multiple-choice 


primacy, distributed repetition, ete. 


. The principles of fr 
ing form: “The more fre 
of sign, means-end-relation 
stronger, other things being ©d" 
to be,” 9 That is, only in a situation 
will frequency be effective. The other 
to raise the question of the law of effect, 
descriptive of favorable conditions in trial- 
experiments. 

As a set, the laws 
reminder of the main tenet: 
Prevailing doctrines. They 


sappointing. They serve as a useful 
t of view and of its criticisms of 


to be asked for on the positive 


are rather di 
s of the poin 
leave much 


28 Tolman (1982b), pages 385-889- 
39 Tolman (1932b), page 386- 
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side in their lack of sufficient precision of statement so that they can be 
called true or false. 

Leaving the laws in this form makes everything a matter of cor- 
relations between situation and behavior and does not get at the formal 
problem of rigorous definition and measurement of intervening variables. 
This lack has not yet been made up, in spite of later reworkings of the 
list of laws. 


The 1949 Version 


( New sets of laws have been proposed by Tolman from time to time.*° 
The “laws” do not have the centrality within his system that Hull’s postu- 
lates have within his, so the revision of the list of laws requires little re- 
interpretation of earlier experimentation. For purposes of understanding 
the restatement of the theory, however, it is useful to review one of these 
later revisions. The one selected for review appeared 17 years after the 
first.** 

In this later paper, (Tolman distinguished six “types of connections 
or relations’) that get learned, and then proceeded to indicate what was 
known about the “laws” applying to each. 

WMH Cathexes.\Tolman borrowed the word “cathexis” (plural, cathexes ) 
from psychoanalytic theory, where it refers metaphorically to the energy 
charge with which an activity is invested. He gives it a related but more 
restricted meaning, almost equivalent to Lewin’s valence. A cathexis is 
an acquired relationship between a drive and an object.) If a hungry 
child wants a banana, you know that the child has a learned cathexis 
between its hunger drive and a banana as a preferred goal-object. This 
is a positive cathexis, for the child is led to seek bananas. A negative 
cathexis works the other way. If a burned child fears a hot stove his 
fear is connected, as a negative cathexis, to the stove, and the cathexis is 
negative because he shuns the stove.) _~ 

Tolman believes that drive reduction (reinforcement) is probably, 
valid as an explanation for the acquiring of cathexes, but he believes we 
need more experimental evidence. 

Both positive and negative cathexes, he believes, are very resistant 
to forgetting. 


ance, in the case of negative cathexes),as it would in the presence of 
8 


40 Tolman (1934) (1937) (1949a). 
41 Tolman (1949a). 
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the goal or actual disturbance. In social behavior, and especially in the 
psychological clinic, we see many evidences of faulty equivalence be- 
liefs. These are often engendered by traumatic experiences. Hence Tol- 
man concludes that the reinforcement principle is valid for equivalence 
beliefs as for cathexes, but it needs to be supplemented by an awareness 
of the importance of traumatic experience. 

48. Field expectancies. These are the rechristened sign-gestalt- 
expectations of the earlier system. (They permit the organism to take 
shortcuts, roundabout routes, and make possible latent learning) Tolman 
reasserts his contention that reinforcement per se is not valid in the ex- 
planation of field expectancies, although he acknowledges the importance 


of motivation. 


The presence of reinforcement in a particular locus makes the locus a goal 
which determines what performance will take place but it does not stamp in S-R 
connections though it probably does give a special vividness to that locus in the 


total field expectancy.** 
ttle at present about laws of the formation of 


Although we know li 
e do work them out we shall have to take into 


field expectancies, when w 


account perceptual sensitivity, memory, and inference abilities. 
The forgetting of field expectancies is probably more rapid than the 


forgetting of cathexes and equivalence beliefs. Such forgetting, Tolman 
believes, probably follows the sorts of laws the gestalt psychologists have 
uncovered, 
vA, Field-cognition modes. These modes are of higher order than the 
field expectancies. A field-cognition mode makes a: given expectancy- 
formation possible. The mode is based in part upon innate capacities and 
in part on previously established field expectancies. / Hence (a field- 
Cognition mode represents a disposition or readiness to acquire field ex- 
pectancies of one of three varieties: perceptual, memorial, or inferential. 
Tolman has no laws to suggest. Such laws will arise out of at present 
Non-exjstent types of transfer experiment. . 
\ / Drive discriminations. Tolman here refers to experiments such as 
those of Hull (1933) and Leeper (1935), in which the animal learned 
to go one way when hungry, another way when thirsty. Tolman has no 
laws to suggest. ` URF ) 
6. Motor patterns. Because his behaviorism is not of the stimulus- 
response variety, Tolman recognizes that he needs some special principles 
governing the acquisition of motor patterns. For these, Tolman was in 


1949 ning the aega Guts simple conditioning explanation. 


42 Tolman (1949a), page 151- 
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It is evident that these laws have not moved toward precision over 
the initial set. Tolman states his own estimate of what he has done here: 


J 


pa . 


And, although, as usual, I have been merely programmatic and have not 
attempted to set up, at this date, any precise system of postulates and deduced 
theorems, I have made some specific suggestions as to some of the conditions and 
laws for the acquisition, de-acquisition, and forgetting of these relationships."* 

While Tolman has made very thoughtful observations on learning, 

and has been responsible for much original experimentation, he_has not 
proposed a system clearly enough defined so that successive approxi- 
mations succeeded in making it more exact and testable. We find, in 
this statement of six kinds of learned relationships, two explained by re- 
inforcement (Hull, Skinner), one explained by simple conditioning 
(Guthrie), and three not explained at all. This is characteristic of Tol- 
man’s role. In addition to making concessions to other interpretations, 
he has consistently pointed out relationships difficult for stimulus- 
response theorists to explain, without himself being able to offer much 
in the way of detailed explanation for these discovered relationships. 


EXPERIMENTS’ON LATENT LEARNING 
“i 


Ss 4 Reinforcement theory proposes that habits are strengthened by rein- 

forcement; sign-learning theory says that expectancies become estab- 
lished, and that performance is secondarily correlated with expectancy. 
(Experiments on latent learning have been used all along to argue against 
reinforcement as an essential principle of learning." )The arguments 
against reinforcement theory had more cogency before Hull modified 
his theory to minimize the amount of necessary reinforcement and to em- 
phasize the factors activating habit into performance. All that can be 
said with confidence, now that Hull has quoted the original latent- 
learning experiments as evidence in support of his predictions, is that 
the details of latent-learning experiments place reinforcement theory 
under strain. But it is equally true that the details of latent-learning 
experiments likewise place cognitive theories under strain, for an ade- 
quate theory (whether a reinforcement theory or a sign-learning theory) 
has to account for both the positive and the negative results with 
which the literature abounds. \Because the latent-learning experiment 
has proved such a fertile area for testing theories, it is worth examining 
the evidence in some detail. 


43 Tolman (1949a), page 154. B 
44 E.g, Tolman (1938b); Thistlethwaite (1951b). 
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Tolman and Honzik’s Experiment ï 

Following up the work started by Blodgett (1929), (Colman and 
Honzik (1930b) studied the effect of introduction of reward in a rat- 
maze experiment after the animals had run several days without food. 
The control group, fed each day in the maze, reduced their error and 
time scores much more rapidly than the non-fed group, but when food 
was introduced for the latter group, error scores and time scores abruptly 
became alike for both groups. Thus the non-fed group had apparently 
profited as much by its earlier trials as the fed group. Since this profiting 
did not show in performance, the learning taking place is said to be 
“latent.’) The results for error elimination are shown in Figure 29. 
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learning in the maze. {Vith no food reward there 

is some reduction in errors, but not as great a reduction as with regular food reward. 


Despite the higher error scores prior to the introduction of food, the group rewarded 


onl. ial immediately begins to do as well as the group that had been 
y from the 11th trial i y that some learning went on withi st 


regularl . The interpretation is 3 : | 
ie ay remade EFE in performance until the food_incentive activated it. 


After ‘Tolman and Honzik (1980b). 


There were three groups: two control groups, one rewarded through- 
out, the other non-rewarded_throughout, and the experimental group, 
non-rewarded until the eleventh trial. On the twelfth day, the experi- 
mental group, Having been fed but once in the maze, made as few 
errors as the group which had received food in the maze each of the pre- 


ceding days. 
There are severa 


12345 


FIGURE 29. Evidence for latent 


1 comments to be made on this situation, in its bear- 
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ing on other theories as well as Tolman’s. The maze used was a 14-unit 
one of multiple-T type,*® arranged with doors between each unit to pre- 
vent retracing. The rat without food at the end still progressed through 
the maze, so that, according to such a theory as Guthrie’s, learning con- 
ditions were ideal. The rat had no opportunity to unlearn what it last 
did in each segment, which was of course to go through the door to the 
next one. The results are critical of Guthrie’s theory not because the rat 
shows latent learning but because it does not show enough learning 
when there is no food at the end of the maze. (It is to be recalled that 
the Guthrie and Horton cats often did not eat when they left the prob- 
lem-box; their behavior was said to be fixed because they left the box.) 
The Guthrie explanation is not sufficient, because the fed rats learned 
much better than the unfed ones. 

Reinforcement theories, in their older form, were put to it to ex- 
plain latent learning. In the older forms of “law of effect” theory, the 
presumption was that reinforcement worked directly upon response 
strength, so that all that was learned should be revealed in performance. 
Reinforcement theorists usually *made much of ‘the decrease in errors 
during non-rewarded trials in latent-learning experiments as evidence that 
some reinforcement was present before reward was-introduced. But they 
could really only account for that much gain by reinforcement, any pre- 
viously concealed gain, shown when reward was introduced, bein 
unaccounted for. In Hull’s later theory, however, in which habit strength 
(sHr) is acquired independently of the magnitude of reinforcement, this 
difficulty no longer holds, Under minimum values of incentive magni- 
tude K (such incentives aś are present without food reward, including ex- 
ploration, return to home cage, etc.) habit strength will be practically 
“latent,” to be revealed in reaction potential when large values of K’ 
are created by introducing the food reward. 

Tolman’s explanation is that the non-reward situation was a good one 
for learning the spatial relations of the maze. Every unit had one dead 
end and another end with a door. The last thing done in each case was 
to go through the door. Recency (which is accepted as favoring sign- 
gestalts ) would strengthen the cognition that the door was the way from 
one segment to the next, though under non-reward conditions there 

were no reasons for the rat to show what it “knew.” The substitution of 
the food at the end of the maze, a highly demanded goal-object, led 
the rat to use its cognitive map, to take the turns which led from one unit 
to the next. Hence the sudden reduction in errors, That the group with 


45 Stone and Nyswander (1927). 


TOLMAN’S SIGN LEARNING 211 


late introduction of reward had a tendency to do even better than the 
regularly rewarded group may conceivably have been due to a more 
thorough exploration of the maze in the early trials by the non-rewarded 
group, with consequent better orientation when finally running toward 
the goal. 


E Five Varieties of Latent-Learning Experiment and Their 
Results \ 


An upsurge of interest in latent learning led to many experimental 
reports, of which 48 have been classified by MacCorquodale and Meehl 
into five categories.*® Their classification is followed here, as well as 
their scoring of the experiments as yielding positive or negative results. ” 

l. Unrewarded trials with later introduction of relevant reward.) 
This is the type of experiment that Blodgett performed and Tolman and 
Honzik followed up. Of nine studies cited, seven are interpreted as 
positive, two as negative.“ 

| 2. Free exploration followed by relevant reward. The animal is 
permitted to run around in the maze for some time. Then it is either 
fed in the goal-box prior to the test runs, or runs the maze as usual to 
food at the end, In either arrangement a control group is necessary for 
comparison. Of five experiments of this kind, all were positive.** ) 

It has been reported that in free exploration animals tend to reduce 
their entrance into culs, thus producing latent learning through a kind 
of “satiation” of blind-alley entrances.‘* But the problem turns out not 
to be that simple, for more recent workers have found the most success- 
ful latent learners to be the ones that made the largest number of cul 
entries during exploration. They did not decrease their cul entrances 
during exploration." 

(8. Location of incentives learned under satiation and tested under 
relevant drive.sIn this third variety of experiment, animals satiated for 


4), pages 199-213. The last of the latent learning 
experiments cited is dated 1952. These writers leaned heavily upon a review by 
Thistlethwaite (1951a), both for earlier experimental citations and for the major classi- 
leatory scheme. Thistlethwaite’s review led to two replies, Kendler (1952) and Maltz- 
man (1952), to which he responded, Thistlethwaite (1952a). ; 

47 Positive results: Blodgett (1929), Elliott (1929), Herb (1940), Simmons (1924), 
Tolman and Honzik (1930b), Wallace, Blackwell, and Jenkins (1941), and Williams 
(1929), Negative results: Mechl and MacCorquodale (1951), Reynolds (1945b), 

48 Positive results: Buxton (1940), Daub (1933), Haney (1931), Karn and Porter 
(1946), Lashley (1918). 


49 MacC Jale and Meehl (1951). 
30 Reheat Weaver (1953). Unfortunately, they used different alley 


Widths and younger rats than MacCorquodale and Meehl. 


48 MacCorquodale and Meehl (195 
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food and water run a T-maze or a Y-maze with food and water in oppo- 
site arms, or in one arm. Then, under test conditions, the “hungry” or 
“thirsty” animal demonstrates latent learning if it can choose the appro- 
priate turn to reach the relevant incentive. Of nine reports of this variety, 
seven yielded positive results, two negative results." 

To explain this kind of learning, the reinforcement theorist falls back 
upon Hull's fractional antedating goal response (re). This is, of course, 
the reinforcement theorist’s equivalent of “cognition.” * The issue, when 
a reinforcement theorist invokes rg, is not over whether or not the “ex- 
pectancy” language is appropriate, for such language is always appro- 
priate when rgs are involved. The issue is over the scientific precision 
and fertility of the “expectancy” interpretation as contrasted with the 
rg one. Because rg’s have the dimensions of ordinary responses they can 
be treated with a quantitative precision. Because cognitions lack such 
dimensions, similar treatment is not feasible. Critics of the Tq concept 
do not deny the possibility of precision, but they believe that pre- 
cision in the use of rg has not been achieved, and that some of the sug- 
gested uses of ře are faulty. For example, if there are cases in which 
palpable rgs do not exist, then the rg theory is insufficient. 

4. Location of incentives learned under strong irrelevant (com- 
peting) drives. If food is present in one arm of a T-maze and water in 
the other, a thirsty animal, through a forced-run procedure, sometimes 
encounters the dry food, without eating it, and sometimes encounters the 
water, and drinks it:\Under these arrangements, will the animal, if made 
hungry, run to the side where food had been encountered? This is the 
only variety of experiment in which the ne 
numbers the positive ones. Of eighteen studi 
and eleven are negative." `} 


These experiments point to complexities that call for detailed analy- 


51 Positive results: MacCorquodale and Mechl (1951), Meehl and MacCor ale 
(1948), Seward, Levy, and Handlon (1950), Spence, Bergmann, and Lippitt Cisco 
Spence and Lippitt (1940), Szymanski (1918), Thistlethwaite (1951b). Negative re- 
sults: Kendler (1947a), Maltzman (1950), Kendler and Levine (1953) have added 
some positive results. 

52 A defense of the ra concept in the interpretation of irrelevant motivation-incentive 
experiments can be found in Spence, Bergmann, and Lippitt (1950), Kendler, Karasik, 
and Schrier (1954). Some difficulties with the re concept are pointed out by Thistle- 
thwaite (1951b). See also Heron (1949). 

58 Positive results: Bendig (1952), Christie (1952), Diesenroth and Spence (1941), 
Strange (1950), Thistlethwaite (1952b), Walker (1951), Walker, Knotter, and De- 
Valois (1950). Negative results: Christie (1951), Fehrer (1951), Gleitman (1950), 
Grice (1940), Kendler (1947b), Kendler and Mencher (1948), Kendler and Kanner 
(1950), Littman (1950), Shaw and Waters (1950), Spence and Lippitt (1946), 
Walker (1948). 


gative results outweigh in 
es, only seven are positive, 
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sis of what is happening. There is an asymmetry, for example, between a 
hunger-food sequence and a thirst-water sequence. One might suppose 
that a hungry and thirsty animal would be more likely to drink water than 
to eat dry food. Yet Heron (1949) found that hungry and thirsty animals 
chose the food side in a familiar apparatus in 100 per cent of their choices, 
where the other side led to water! Animals that were satiated, that is, 
that had all they would eat or drink, chose the water side in 92 per cent 
of the trials. Heron’s conjecture that perhaps the choice was made on the 
basis of full or empty stomach, rather than on the basis of drive discrimi- 
nation, was supported by the choices of animals whose stomachs were full 


of water. They, like the food-and-water satiated animals, chose the water 


side, insofar as they made any choices at all. 
Another complexity is introduced because under strong drive some of 


the animals have to be forced down the alley leading to the irrelevant 
incentive. Walker, Knotter, and DeValois (1950) showed that animals 
that had to be forced demonstrated latent learning less frequently than 


animals not forced. They showed also that animals with strong position 


preference during the early runs were less likely to show latent learning in 


the test runs. 
When all of these complexities are unravelled, the focus of the con- 


well change, just as it has already changed 


troversy over theory may very ; 
earning to the circumstances under which 


from the existence of latent | 


latent learning occurs. ae 
5. Exploration of paths leading to distinctive goal-boxes, followed 


by prefeeding in one goal-box prior to the test run. This design differs 
from the second type (exploration of a maze) in that discriminable goal- 
boxes are used) It is in reality a variant of Maier’s (1929) reasoning ex- 
periment, in which many of the same arrangements are involved. Hence 
Maier’s experiment, and the successful repetitions of it, give some addi- 
The typical experiment with this design uses 
distinguishable end-boxes. After the rat is 
familiar with the arms of the maze and the end-boxes to which these lead, 
it is fed in one of the end-boxes. It demonstrates latent learning if, when 
placed at the entrance, it chooses the side leading to the appropriate 
goal-box. Of seven studies of this kind, four are classified as positive, 


three as negative.” 


What conclusions can 

53 tv, - Gilchrist (1952), Iwahara and Marx (1950), Seward (1949), 
regen euls Gilen Negative results: Denny and Davis (1951), Seward, 
Datel, ad I ey (1952) (In the last study, one of three replications came out posi- 


tive.) 


tional supporting evidence. 
a T-maze with two easily 


we draw from this array of contradictory 
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evidence? So far as sheer “score” is concerned, of the studies reviewed 
by MacCorquodale and Meehl, thirty were reported as positive, eighteen 
as negative. There is no longer any doubt but that, under appropriate 
circumstances, latent learning is demonstrable. 


Other Varieties of Latent-Learning Experiment 


Because the possibilities of learning under irrelevant motivation- 
incentive conditions are enormous, the five foregoing experimental varie- 
ties are not exhaustive. 

An ingenious and novel test of latent learning has been proposed by 
Muenzinger and Conrad (1954). They suggest that latent learning can 
be tested by its interference with a novel task as well as by its accelera- 
tion of learning in the original task. The interference (“negative trans- 
fer”) method permits a distinction between two phases of latent learning. 
In the first phase the non-rewarded animal becomes “maze-wise” without 
learning the true path. If in this phase the animal is moved to a new 
maze that is the mirror-image of the first, and run with reward, he shows 
positive transfer, that is, learns more successfully than control rats with- 
out the prior non-rewarded experience. In the second phase, “true-path” 
learning occurs, and produces negative transfer when the rat is run in 
the mirror-image maze with reward. This second phase demonstrates 
genuine latent learning of the true path. In the reported experiments the 
first phase lasted for 10 trials (five on each of two days), and the last 
phase began only on the third non-rewarded day. If the reward is intro- 
duced earlier (on the first or second day) true-path learning sets in 
earlier, and negative transfer is demonstrable in the mirror-image maze," 
These findings suggest several important supplementations to the previous 
interpretation of latent learning—both as to what is happening in the non- 
rewarded runs and what happens when the reward is introduced, 

There has been a long history of experimentation on “incidental 
learning” in human subjects, that is, on the learning of something tan- 
gentially related to an ongoing task upon which attention is directed. 
These experiments have recently been assimilated to latent learning, for 
they are appropriately classified as learning under low degrees of rele- 
vant motivation, such learning being followed by more direct testing. For 
example, Stevenson (1954) finds striking increase in latent-learnin 
ability with age during the preschool years. Postman and Thuma (1954) 


55 Kanner (1954) also shows that rewarded animals learn somewhat more than un- 
rewarded ones. 
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distinguish between the subject's recognition of the presence of the inci- 
dental stimuli and his recall of their location. 
Following their review of the latent-learning literature, MacCor- 


quodale and Meehl conclude: 


In spite of the preceding difficulties of interpretation, it seems safe to say 
that the current state of the evidence is at least encouraging to the theorist 
oriented to some form of expectancy theory. We were, frankly, somewhat more 
impressed by the overall trend of the evidence than we had expected to be.** 

They have suggestions for both the major competing theories. First, 
they say, cognitive theorists have rested too often with attempts to con- 
found the opposition by posing the question “How do you explain that 
one?” The cognition theorist must take seriously the accounting for nega- 
tive results as well as taking satisfaction in the positive results. Second, 


reinforcement theorists have tended to use an “elastic” explanation based 


on ra. They must use the Tc concept rigorously if they wish to assert the 
superiority of their type of explanation. 


' (ESTIMATE OF TOLMAN’S SIGN LEARNING ) 


(Tolman’s Position on the Typical Problems of Learning | 


a system with some aspirations toward completeness, 


Because it is ae 
writings relevant to most of the 


there are statements within Tolman’s 


problems raised by other writers. 
1. Capacity. Tolman has recognized the need for capacity laws. The 


matter interests him chiefly because of the possible gradation of learn- 
ing tasks from those requiring least to those requiring most intelligence. | 
It is natural that one who makes predictions about what animals will do. 
in problem-solving situations is confronted with the limitations of one 
organism as compared with another. [olman believes that the high de- 
gree of specificity of capacities in the rat is due to the lack of influence of 
a culture which prizes certain behaviors over others. Hence one of the 
contributions of animal studies may þe to show processes at a subcultural 
level,57 : 

( a2 Practice. The law of exercise is accepted in the sense of the 
frequency with which the sign, the significate, aid! the: peon fe 


tion between the two, have been presented Exercise is not the cause of 


the initial selection of the right response. Mere frequency without “be- 


56 MacCorquodale and Meehl (1954), page 218. 
57 Tolman (1945). 


AE ad 
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longing” does not establish a connection. After a response has been 
learned, overexercise tends to fix it, making it unduly resistant to 
change.** 

fd. Motivation. Rewards and punishment tend to regulate perform- 
ance, rather than acquisition, although they are related to acquisition 
also because they serve as “emphasizers” and because goal-objects con- 
firm or refute hypotheses. /Because of the demonstration of latent learn- 
ing, the law of effect in its usual sense (reward as a strengthener of 
response tendencies) is not accepted. 

4, Understanding.(Cognitive processes are of the very essence of 
molar behavior and learning. Hence Tolman is friendly to learning by 
creative inference, inventive ideation, and so on.)He repeatedly states, 
however, that he does not wish to imply “introspectively get-at-able 
conscious contents.” The prototype of learning is sensible, reasonable ad- 
justment according to the requirements of the situation; stupid learning 
occurs as a limiting case when the problem is unsuited to the learner’s 
capacities or is set up in inaccessible form.(Insightful learning is not 
limited to the primates; it is characteristic of rat behavior as well. 

5. Transfer. The problem of transfer of training as such has been 
of relatively slight interest to those experimenting with animals. To 
some extent all the experiments on change of reward, change of drive, 
place learning, and latent learning are experiments on problems related 
to transfer, that is, the ability to use something learned in one situation 
in relation to another.’ All cognitive theories expect a large measure of 
transfer, provided the essential relationships of the situation are open 
to the observation of the learner. 

16. Forgetting. Having earlier experimented in the field of retroac- 
tive inhibition,” it is probable that Tolman is friendly toward some theory 
of retroactive inhibition, and he has indicated that he accepts the 
Freudian mechanism of repressiori.°° } 

He has also asserted the resistance of cathexes and equivalence be- 
liefs to forgetting, and the susceptibility of field expectations to the 
kind of forgetting emphasized by gestalt psychologists. 


Molar Behavior as a Field Concept 
The student of molar behavior contends that there can be a psy- 


chological science or behavioral science in its own right, not waiting for 


58 Krechevsky and Honzik (1932). 
59 Tolman (1917). 
60 Tolman (1942), pages 63-64. 
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its progress upon advances in other sciences. While there is some con- 
troversy about this, it is a position which can be defended by both 
stimulus-response and cognition psychologists. Hull too, was able to 
adopt a concept of molar behavior by this definition. 

Beyond the matter of level of discourse there lies the question as 
to whether molar behavior is essentially a field concept. The question 
might be put another way. To what extent is Tolman’s sign-gestalt theory 
a true gestalt psychology? Koftka, in his review of Tolman’s book, wel- 
comed the friendliness to gestalt, but deplored some limitations which 
he detected in Tolman’s variety of gestalt." He believed that the distinc- 
tion between molar and molecular did not go far enough, because the 
reality of the molecular was acknowledged. Koffka would wish field 
principles applied to physiology as well as to psychology, so that all 
explanatory concepts should be molar. Furthermore, Tolman s preoccupa- 
tion with behavior sequences and historical interpretations makes sign- 
gestalts only a limited illustration of the variety of possible gestalts. 

This disadvantage, from Koffka’s point of view, makes it easier to 
compare Tolman’s experiments with those done under the influence of 
other theories. Since the experiments are done with rats in mazes and 
discrimination situations, they are directly comparable to the experi- 
ments in the typical American animal laboratory. Therefore Tolman chal- 
lenges prevailing conceptions such as those of Thorndike and Guthrie 
and Hull more vividly than the more orthodox gestalt writers, whose 
situations are often so different as to be incommensurate. 

Tolman makes so many generous acknowledgments to Lewin that 
there is no doubt of the affiliations between their systems, as third parties 


have made clear. Their differences became adjusted in a brief and 


friendly controversy. mi i ili 

The conception of molar behavior is consonant > n pre — ae? 
ceptions of science, in which some degree of arbitrariness is recognized 
le from the totality of natural events for 


in the abstractions to be mac 
the purposes of any given science. Natural laws can be formulated in 
a 


many different ways to cover asp Kias A 
doubtedly done a service to psychology in joining with those who see 
the importance of a “psychological” psychology, that is, one whose con- 
cepts are appropriate to the level of its descriptions and predictions. 


ects of occurrences. Tolman has un- 


ê Koftka (1933). 

2 E.g., White (1943). 

6s Tolman (1932a), Lewin 
(1948a). 


(1933b). For his appreciation of Lewin, see Tolman 
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The Status of Intervening Variables 


The distinction between performance and learning, which must be 
accepted in one form or another, requires that learning be inferred from 
performance. These inferences are always being made, even in the most 
“objective” sorts of observations, because only “relevant” physical or 
physiological occurrences are recorded. There is nothing especially new 
about Tolman’s intervening variable, except that he points out clearly and 
insistently that the intervening variable is there, and is not out of place 
in a behavioral science. The logical mistake is easily made of supposing 
that the physical and physiological terms to which the intervening varia- 
bles are anchored are themselves independent of theories, which they 
are not. The kinds of experiments which are performed, the kinds of 
measurements which are taken, always involve selection by the experi-. 
menter. The data, even though reproducible, are not “pure” facts of 
nature. The question may therefore be raised whether the intervening 
variables are going to be anything not found in the experimental rela- 
tionships directly. It is doubtful whether a satisfactory answer can be 
given until the intervening variables are identified with quantities. If 
there are derived constants, interchangeable from one situation to an- 
other, then the intervening variable becomes a scientific construct of some 
importance."* 

As previously pointed out, Hull has adopted the intervening vari- 
able also, so that the logic is common to the differing points of view. 
The remaining difference lies in the sorts of intervening variables chosen. 

A distinction has been made by MacCorquodale and Meehl (1948) 
between two kinds of symbolic intermediaries: “intervening variables” 
and “hypothetical constructs.” A pure “intervening variable” is a mere 
convenience in system-making, and has no properties or dimensions other 
than those specified in the equations that define it. A “hypothetical con- 
struct” is an entity with properties attributed to it beyond those of the 
functional equations in which it appears, Hull’s re is an example of a 
hypothetical construct because it has all the properties of a muscular 
response and is said to give rise to proprioceptive stimuli (sc) as any 
muscular response does. While it is hypothesized, rather than observed, 
it is potentially observable and need not always have the status of a 
metaphor. Although Tolman’s system appears to be built around “inter- 
vening variables” in the MacCorquodale and Meehl sense (demands, 
cognitions, etc. ), Tolman denies this and asserts not only that all theories 


64 Seward (1955). 
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use both types of intermediary, but that hypothetical constructs are 
desirable.** 

The distinction made by MacCorquodale and Meehl is a useful one, 
but it is a mistake to jump to the conclusion (which they did not sug- 
gest) that one of the types of intermediary is invariably preferable to the 
other. Possibly what we need is a clearer specification of the rules under 
which each should be used. Consider the gene theory in biology. If the 
gene is believed to be “a something” that has its place on the chromo- 
some, it is a hypothetical construct, and its added properties are such as 
may be revealed eventually under the microscope or by chemical anal- 
ysis. If it were merely a pawn to be used in the study of the statistics 
of unit characters and their genetic transmission, the gene would be a 
pure intervening variable. In this case, the status of hypothetical con- 
struct has seemed to be a useful one, as one considers chromosome maps, 
crossovers, and so on. The gravitational constant (g) used in the law 


of falling bodies is another kind of intervening variable, for it is merely 


a constant defining the acceleration of a free-falling body under speci- 
fied conditions, with no other status beyond that. It is a convenience in 


harmonizing free-falling bodies, balls rolling down inclined planes, and 
the period of a pendulum, and as such justifies itself even though it has 
no independent existence as an entity. Precisely what kinds of intervening 
variables prove useful in psychology will depend upon what we can do 
with them. The test comes in their clarity, specificity, and fertility in 
system-making. If a hypothetical construct is used, its surplus meanings 
should be reasonable, and ideally a search should go on for its inde- 


pendent verification. 


What Kind of Behaviorism? 

When Tolman announced his purposive behaviorism oi, Aperioaty 
psychology was still excited over the new behaviorism of Watson. It 
was Tobear’s contribution then to show that a sophisticated behaviorism 


can be cognizant of all the richness and variety of psychological events, 
and need Pi be constrained by an effort to build an engineer’s model of 


the learni chine. ¢ r 
With the diversification of behaviorism under the influence of Tol- 
man and others, the old brittleness of Watsonian_behaviorism has largely 


disappeared, and what virtues there are in the behavioristic position 
f a 


65 Tolman (1951b), pages 282-283. 


1 pages Taze (1954). 
(195 aes ot ( 105i ye ame already ten years old when his major book ap- 


peared. 


For a further discussion of the issues, see Marx 
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have now become part of the underlying assumptions of most American 
psychologists—without most of them thinking of themselves as behavior- 
ists at all. 

Tolman’s catholicity and his friendliness to new ideas have prevented 
his developing a “tight” or “elegant” system, even though he has all along 
made suggestions as to what such a system would be like. fle has kept 
his prominent place in the forefront of learning theorists by his sensi- 
tivity to important problems, by his inventiveness in experimentation, and 
by keeping others on their mettle. 
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Chapter 7 


CLASSICAL GESTALT THEORY 


URING the first quarter of the century in America the quarrels 

within academic psychology lay chiefly inside the framework of 
association psychology. Structuralism, functionalism, and behaviorism 
were all members of the association family. A few dissident voices, such 
as Freud and McDougall, got little hearing. This complacency was dis- 
turbed by the new gestalt doctrine which influenced American learning 
theories chiefly through the appearance in English of Wolfgang Kohler’s 
Mentality of apes (1925) and Kurt Koffka’s Growth of the mind (1924), 
The theory had been developing in Germany since it was first announced 
by Max Wertheimer in 1912, but these books, and the visits of Köhler 
and Koffka to America about the time of their publication, brought the 
new theory vividly to the attention of American psychologists, 

Koffka’s book had an important effect upon American learning theory 
because of its detailed criticism of trial-and-error learning as conceived 
by Thorndike—a thrust at the very heart of the currently popular theory. 
The vigorous attack upon Thorndike (and upon behaviorism, although 
Thorndike was not, strictly speaking, a member of the school) was sup- 
ported by Kéhler’s well-known experiments on apes, described in detail 
in his book which appeared close in time to Koffka’s. Kohler’s book brought 
the notion of insightful learning into the foreground, as an alternative 
to trial and error. He showed how apes could obtain rewards without 
going through the laborious processes of stamping out incorrect responses 
and stamping in correct ones, as implied in Thorndike’s theories and as 
displayed in the learning curves of Thorndike’s cats. Apes could use 
sticks and boxes as tools; they could turn away from the end of the 
activity toward a means to the end. 

Kéhler’s experiments with apes were done in the years 1913-1917, 
on the island of Tenerife off the coast of Africa. His book about these 
experiments ' appeared in English a few years later and immediately was 


1 Köhler (1917). 
Kohler 1 222 
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widely read and quoted. Two main series of experiments interested the 
American psychological public in the problems of insight. These were 
the box problems and the stick problems. 

In the single-box situation, a lure, such as a banana, is attached to 
the top of the chimpanzee’s cage. The lure is out of reach but can be 
obtained by climbing upon and jumping from a box which is available in 
the cage. The problem is a dificult one for the chimpanzee. Only Sultan 
(Köhler’s most intelligent ape) solved it without assistance, though six 
others mastered the problem after first being helped either by having 
the box placed beneath the food or by watching others using the box. 
d by direct imitation of others. What watch- 
ing others use the box did was to lead the observer to attempt to use the 
box as a leaping platform, but sometimes without making any effort 

he lure. When the problem was mastered, a 


whatsoever to bring it near t 
chimpanzee alone in a cage with box and banana would turn away from 


the goal in order to seek the box and to move it into position. This “de- 
tour” character of insightful behavior is, according to Köhler, one of its 


important features. i 
blem, requiring that a second box be placed 


The box-stacking pro 
upon the first before the banana can be reached, is much more difficult. 


It requires both the incorporation of the second box into the pattern of 
solution, and a mastery of the gravitational problem of building a 
stable two-box structure. While the emphasis in secondary accounts of 
Kéhler’s work is usually upon the intelligence which his apes displayed, 
he himself is at pains to account for the amount of apparent stupidity. 
In the box-stacking experiment, for example, he believes that the apes 
have shown insight into the relationship of one-box-upon-another, but 
not into the nature of @ stable two-box structure. Such physical stability 
as was achieved in later structures was essentially a matter of trial and 
error, ; 

The stick problems required the use of one or more sticks as tools 
with which to rake in food out of reach beyond the bars of the cage. 
The beginning of insight occurs as the stick is brought into play, al- 
though often u successfully, as when it is thrown at the banana and lost. 
Once it has been used successfully, it p sought as the chiipauzee 
and used promptly. The most dramatic of the stic ug experiments 
was in a problem mastered by Sultan, in whieh ewe y es sticks 
were joined together after the manner of a jointed fis hing pole in order 
to obtain a banana which could not be reached with either stick alone. 
The process was a slow one, and the first placing of the sticks together 


The problem was not solve 
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appeared to be more or less accidental. Once having seen the sticks in 
this relationship, however, Sultan was able to “get the idea” and to 
repeat the insertion of one stick into the end of the other over and over 
again. 

Although the attack by Kohler and Koffka was chiefly upon Thorndike, 
it came at a time when American psychology was in the grips of a confident 
but somewhat sterile behaviorism. It is hard to see at this distance why such 
a common-sense and familiar notion as insight in learning should have 
created such a stir. But at the time Watsonian behaviorism had, in fact, 
won support for a fairly “hard-boiled” view of learning, according to 
which the organism was played upon by the pushes and pulls of the en- 
vironment and reacted in ways essentially stupid. Lloyd Morgan’s canon 
which had seriously undercut the attributing of higher mental processes 
to animals had fairly well succeeded via behaviorism in excising them 
from man also. Therefore the return to a more balanced view, represented 
by the insight experiments, gave new hope to teachers and others who 
saw thinking and understanding returned to respectability. Insight was 
not a new discovery—it was a return to a conception laymen had never 
abandoned. Nobody uninfluenced by peculiar doctrines would ever have 
denied insight as a fact—yet it took Köhler to restore it as a fact in 
American psychology. It was, in some respects, time for a change, and 
Kéhler’s experiments dramatized release from the negatives of Thorn- 
dikian and Watsonian thinking. 

That the more enthusiastic reception for the new learning theories 
should have come first from the educators is not surprising.” There had 
already been a rift growing between Thorndike and the more progressive 
group within education, who, under Dewey’s leadership, had made much 
more than he of the capacity of the individual for setting and solving his 
own problems. The new insight doctrine fitted nicely their slogan of 
freeing intelligence for creative activity. 

Animal psychologists like Yerkes, who had never espoused behavior- 
ism, welcomed the new movement as a natural development. Yerkes him- 
self had done experiments on insightful learning independent of gestalt 
influences,’ and the intelligence demonstrated by Kéhler’s apes did not 
surprise him. Curiously enough, insightful learning in subhuman ani- 
mals was less threatening to theorists than learning by understanding in 
man, chiefly because it was still the rather rare and unusual behavior 
among animals. Rats still learned mazes, it was thought, without in- 


2 It was an educator-psychologist, R. M. Ogden, who translated Koffka (1924), 
3 E.g., Yerkes (19167 
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sight. So the animal experimenters added insight experiments to their 
list and continued both old and new experiments. But if the insight 
doctrine were to be accepted in human learning, the field would be wide 
open for destroying all the familiar laws of learning as they applied to 
man. It is not surprising that those who were at the time concerned more 
largely with human learning, such as Thorndike, Robinson, and Guthrie, 
should all have been cool to the insight concept. 

The visible opposition between Köhler and Thorndike was over 
insight and trial and error, that is, over intelligent learning as con- 
trasted with blind fumbling. But the opposition between gestalt psy- 
chology and association psychology goes much deeper. In order to under- 
stand this opposition, it will be necessary to examine the gestalt views 
in greater detail. 

There are a number of variants within the gestalt movement and 
among those strongly influenced by gestalt conceptions. Kohler and 
Koffka were closest to Wertheimer, the official founder of the school. 
voted to their treatment of learning. Lewin, while 
n and definitely within the ranks, broke enough new 
ground so that a separate chapter is devoted to his position. All four 
of these men, originally German, eventually settled in America, where 
three of them (Koffka, Lewin, and Wertheimer) have since died. They 
are the Jeaders of what is historically gestalt psychology. 

Their views were picked up and considerably modified by Wheeler 


(1929) (1932) (1940) under the name of organismic psychology, but 
ch from classical gestalt psychology for 


This chapter is de 
originally from Berli 


his position deviated too mu 
him to be counted within the core group.* 

The fullest and most systematic treatment of the problems of learn- 
ing from the gestalt viewpoint is found in Koffka’s Principles of gestalt 
psychology (1935). It was written after a period of acclimatization to 
America, and so meshes somewhat better than earlier writings with the 
concerns of American psychologists. Most of the direct references will be 


made to this source. 
orated, with Perkins, in a widely used textbook on edu- 
and Perkins (1932). For an account of the organismic 


sonun. Se ‘tion of this book, Hilgard (1948), pages 234-260. Because 
e a influential, the chapter has been dropped from this 


edition, despite some provocative ideas contained in the theory. 


4 Wheeler’s views were elabi 
cational psychology, Wheeler 
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THE LAWS OF ORGANIZATION 


Gestalt psychology had its start and has achieved its greatest success 
in the field of perception. Its demonstrations of the role of background 
and organization upon phenomenally perceived processes are so convinc- 
ing that only an unusually stubborn opponent will discredit the achieve- 
ment. The primary attack upon association theory was an attack on the 
“bundle hypothesis” sensation theory—the theory that a percept is made 
up of sensation-like elements, bound together by association. 

When the gestalt psychologists turned later to the problems of 
learning, the equipment brought to the study of learning was that which 
had succeeded in the field of perception, and the arguments previously 
used against the sensation were turned against the reflex. In spite of the 
attention which Kohler’s ape experiments received, gestalt psychologists 
can be fairly said to have been only moderately interested in learning. 
This does not mean that their few experiments are without significance; it 
means only that they have considered the problems of learning secondary 
to the problems of perception. Perhaps in America the shoe is on the 
other foot, and in preoccupation with learning we have too long neglected 
the relationship between the two fields.* 

The starting point for Koffka’s treatment of learning is the assump- 
tion that the laws of organization in perception are applicable to learn- 
ing. This applicability is enhanced because of the prominence in learn- 
ing given in his theory to the initial adjustment, to the discovery of the 
correct response in the first place. Since this discovery depends upon 
the structuring of the field as it is open to the observation of the learner, 
the ease or difficulty of the problem is largely a matter of perception. In 
some sense, Kéhler’s apes were presented with perceptual problems; if 
they literally “saw” the situation correctly, they had “insight.” 

The application of the laws of organization to learning problems is 
done too casually by Koffka to be very convincing, but if he had been 
more systematic about it, the development of the argument might have 
gone along the following lines. There would be a guiding principle (the 
Law of Priignanz) and four laws of organization subordinate to it: the 
laws of similarity, proximity, closure, and good continuation. 


5 The point was made some years ago by Leeper (1935b). 


Lo 
to 
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The Law of Prägnanz 

The law of Prignanz è suggests the direction of events, Psychological 
organization tends to move in one general direction rather than in other 
directions, always toward the state of Prégnanz, toward the “good” 
gestalt. The organization will be as “good” as prevailing conditions 
allow. A “good” gestalt has such properties as regularity, simplicity, 
stability, and so on. 

Because of the dynamic properties of “fields,” the conditions of 
equilibrium are necessarily important. In physics, processes which 
terminate in stationary distributions are characterized by certain maxima 
and minima, as by a minimum of energy capable of doing work. This 
minimum for the whole sometimes requires a part to absorb a maximum 
of energy. The law of Prégnanz is a law of equilibrium like the princi- 
ples of the maximum or minimum in physics. It ought to correspond to 
them in reality, since it is the phenomenal representation of physiological 
processes which obey physical laws. In effect, however, it is used as an 
analogy. When organization moves toward a minimum, it is character- 
ized phenomenally by the simplicity of uniformity; when it moves toward 
a maximum, it is characterized by the simplicity of perfect articulation." 
“We might say, sacrificing a great deal of the precision of the physical 
Proposition, that in psychological organization either as much or as 
little will happen as the prevailing conditions permit.” 8 

Learning situations are problematical. They therefore give rise to 
tensions and to disequilibria. Some such principle as the law of Präg- 
nanz becomes appropriate to them, although Koffka does not develop the 
Point, except by way of the other laws, each of which, in its own way, 
is an illustration of the more general principle. 

l. The law of similarity. The law of similarity or of equality is the 
Counterpart of association’s law of similarity. This and the other laws 
all derive from Wertheimer (1923). He used it as a principle determin- 
ing the formation of groups in perception, such as groups of lines or 
dots, Similar items (e.g, alike in form or color) or similar transitions 
(e.g. alike in the steps separating them) tend to form groups in per- 
ception. In a series of experiments with nonsense syllables, two-place 
numbers, and nonsense two-dimensional figures, Köhler (1941) showed 
quite conclusively that similar (homogeneous) pairs were much more 
ately translated as “pregnancy.” It has the meaning 


è The German word is inadequa as 
of “knapp, und doch vielsagend” (compact but significant). 


7 Koffka (1935), pages 171-174. 
8 Koffka (1935), page 108. 
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readily learned than dissimilar (heterogeneous) ones. a 
pairs illustrate the law of similarity. He denied that his results supporte 
simple association theory, preferring instead to attribute the results to 
“interaction” producing a unitary trace rather than a “connection” be- 
tween similar items. 

The law of similarity was applied by Koffka to the selection of a 
memory trace by a process active at the time of recall. That trace will be 
selected by an excitatory process which possesses the same wholeness 
character. The meaning of trace selection will be considered later. The 
meaning is conveyed sufficiently by the process of recognition, where a 
face present now recalls the same one seen earlier and results in the 
feeling of familiarity. 

2. The law of proximity. Perceptual groups are favored according to 
the nearness of the parts. Thus if several parallel lines are spaced un- 
evenly on a page, those nearer together will tend to form groups against 
a background of empty space. Because whatever favors org 
will also favor learning, retention, and recall, the law of 
comes the gestalt equivalent of association by contiguity, Patterning 
through proximity holds also within audition, 


as in the grouping of 
successive clicks. Then the proximity is a temporal one. As it applies to 
memory, the law of proximity becomes also a law of recency. Old im- 


pressions are less well recognized and recalled than new ones because 
the recent trace is nearer in time to the present active process, 

3. The law of closure. Closed areas are more stable than unclosed 
ones and therefore more readily form figures in perception. As applied 
to learning, closure is an alternative to the law of effect. The direction of 


behavior is toward an end-situation which brings closure with it. It is 
in this manner that rewards influence learning. 


anization 


proximity be- 


So long as activity is incomplete, ever 
to the animal a transitional situation; wher 


y new situation created by it is still 
goal, he has arrived 


eas when the animal has attained his 
at a situation which is to him an end-situation,20 


In a problematic situation the whole is seen as incomplete and a 


tension is set up toward completion. This strain to complete is an aid to 
learning, and to achieve closure is satisfying, This is the m 


eaning of 
the above quotation, and shows how closure is an alternative to effect, 
4. The law of good continuation. This is the last of Wertheimer’s 


principles taken over by Koffka, although Wertheimer had several more. 
* Koffka (1935), page 464. 


1° Koffka (1924), page 102. From The growth o the mind b k: . 
Paul, Trench, Trubner & Co., Ltd., London, 1994. f y K. Koffka, Kegan 
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Organization in perception tends to occur in such a manner that a 
straight line appears to continue as a straight line, a part circle as a circle, 
and so on, even though many other kinds of perceptual structuring would 
be possible. Closure and continuation are aspects of articulate organiza- 
tion. Oragnization applies to learning as well as to perception. 

Köhler (1941) demonstrated, for example, that the learning of 
paired figures was facilitated when the figures “fitted,” that is, when the 
cue and response items formed a regular pair. Pairs fitting less well were 
harder to learn. 

In each of the foregoing laws we have a principle from perception 
applied to learning. 


THE SPECIAL PROBLEMS OF LEARNING 


The general point of view of gestalt psychology is expressed in the 
statement that the laws of organization apply equally to perception and 
to learning, There are, however, special problems within learning to 
which Koffka devoted considerable discussion. Because of his anti- 
empiricist ™ position, he had to find some way of dealing with the evi- 
dent influence of earlier experiences on present performance. The prob- 
lem is best approached via memory, in which the past is represented 
somehow in the present. A second problem concerns the gradual trans- 


formation which takes place as skills of the trial-and-error sort are mas- 


tered. Finally, of course, there is the problem of restructuring the present 
field, as implied in insightful learning and in productive thinking. 


The Role of Past Experience: the Trace Theory 


Because modification by and through experience is part of the very 
definition of learning, the gestalt attitude toward experience is important. 

The gestalt preference is distinctly for conceiving psychological proc- 
esses as the function of the present field, and the explanatory role of past ex- 
perience is denied in situation after situation in which to others it seems to 
be important. Examples include the perceptual constancies—whereby a 
e, a red coat looks equally red in sunshine 


man looks man-size at a distanc ; : : : 
and shadow. The illusions of movement and perception of third dimension 


are also included. Koffka, in spite of a vigorous objection to empiricism, 
takes a moderate view toward the role of past experience in learning. 


“uK, 3 iricist, objecting to an explanation of present perception solely 
on ‘ee ota ek This did not make him a nativist, however, for he 
believed that pealk laws of organization provided a third solution to the problem of 
space perception. See Koffka (1935), pages 160-161; Luchins (1951), pages 83-86. 
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i i that an anti-empiristic attitude does not 
ede CREEL GRC epost lan ok experience. Not that it makes use of ex- 
perience causes our objection to empiricism, but how it makes use of it.!2 

A favorite experiment repeatedly cited by gestalt psychologists in 
order to disprove the role of experience is that of Gottschaldt (1926). 
For example, if a picture of a letter E is presented 1000 times, and then 
a church window is exposed, are you any more likely to notice that 
some of the leaded lines in the windows could form a letter E than if 
E had been presented only once? We may doubt if a jury of association 
psychologists or anyone else would expect experience to tear down a 
percept into the thousand and one possible parts unless there were some 


kind of search involved. If you looked for the hidden part, and found 
it, the finding would be easier the next time, Cottsch 


aldt accepts this 
conclusion, 


and had evidence that the results of previous discovery were 
evident in later tests. In the newspaper puzzles with faces hidden in the 
trees, once the face has been found it is more easily found again. Gott- 
schaldt’s experiment reduces to the demonstration that camouflage hides 
familiar objects as effectively as it hides unfamiliar ones—provided we 
have no reason to be looking for the familiar objects. Gottschaldt’s experi- 
ments are cited by gestalt psychologists as very damaging to association- 
ist or empiricist positions. Because of the difference in conception as to 
what is important, the experiment has not impressed association psycholo- 
gists as being a crucial refutation of their position." 
By the very nature of the case, it is not 
chologist to dismiss the role of experience in memory as it is to dismiss 
its role in perception. Memory so obviously depends upon prior experi- 
ence that it would be foolhardy to deny it. Koffka is as puzzled as a non- 
gestalt psychologist over Wheeler’s attempt to get rid of memory traces," 
Koffka believes some trace theory essential and proceeds to consider how 
the traces of past experiences can be reactivated by present processes, 
The trace hypothesis is an involved o 


ne, and its full exposition re- 
quires over 100 pages of text.! The essential features of the theory are 


(1) a trace is assumed which persists from a prior experience, so that it 
represents the past in the present; (2) a Present process is also posited, 
one which can select, reactivate, or in some manner communicate with 
the trace; and (3) there is a resulting new process of recall or recogni- 


12 Koffka (1935), page 639n. 
é ™ Moore (1930) Braly (1933) and Henle (19422) dispute Gottschaldt’s in 
ions. 


14 E.g., Wheeler (1932), pages 167-169, 
15 Koffka (1935), pages S598. 


as easy for the gestalt psy- 


terpreta- 
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tion. The process and the trace are to be distinguished; they are localized 
in different parts of the brain. The trace system is organized according to 
the same laws applying in other fields, and the communication between 
process and trace follows these laws. 

The trace concept was further elaborated by Köhler (1938) and by 
Katona (1940). Katona made a distinction between individual traces, 
referring to specific items, and structural traces, derived from the whole- 
ness character of a process. The structural traces are said to be more 
adaptable and flexible, to be formed more quickly, and to persist longer 
than individual traces. 

By way of the doctrine of traces, the gestalt psychologist is able to 
represent a past event in the present. That is, of course, all that the as- 
sociation psychologist proposes to do. But the trace system, if it is a 
system organized under gestalt laws, must undergo changes according to 


the law of Prägnanz. If these changes are of a systematic sort it will be 


evidence against a theory of mere connections weakening in time or in- 


hibited by new learning. 

The experiments of Wulf (1922) and later experiments following up 
his suggestions will be reviewed among the illustrative experiments. The 
main point is that perceived figures are reproduced differently from the 
original model, and that the differences are systematic and progressive 
rather than random. The changes with successive reproduction cor- 
respond to the laws of organization, and move toward the “good” gestalt. 
A circle with a small break in it tends to be reproduced as a whole circle, 
and an asymmetrical figure tends to become more symmetrical in a draw- 

wo chief tendencies noted by Wulf: 


ing of it from memory. There are t 
leveling and sharpening. The leveling tendency is that already described, 


a tendency to move according to the intrinsic character of the figure into 
symmetry and uniform relations of parts. Sharpening consists in the ac- 
Centuating of details which serve as the discriminatory features of the 
pattern, For example, a saw-toothed figure may be reproduced with 
deeper and more striking teeth. Against the theory that memory leads 
to decay and fuzziness, the gestalt theory is that it leads to change but 
in the direction of greater clarity. A third tendency pointed out by Wulf 
is called normalizing. A figure which looks something like a familiar ob- 
ject tends on reproduction to be drawn more like such an object. All 
these changes (leveling, sharpening, and normalizing ) are in the direction 


of a “good” gestalt. 
To the extent th 


16 Katona (1940), pages 194-195. 


at these systematic changes occur in the trace, there 


232 THEORIES OF LEARNING 


is a real addition which gestalt theory makes to other theories of memorial 
change. 


New Learning: the Formation of Traces 


What happens as new traces get formed? It is to be recalled that 
Koffka distinguishes between the process and the trace. The process is 
that which goes on because of the present stimulating situation; the trace 
is the result of earlier processes. 

1. Some processes are directly dependent upon stimuli. When such 
stimuli are presented a second time, the processes differ from those pres- 
ent the first time because the stimuli have been reacted to before, For 
example, the second exposure may be recognized as “familiar.” This dif- 
ference suffices to show that learning took place with the first exposure. 
His illustrations are limited to perceptual ones, and it is apparently 
perceptual responses to which Koffka refers when he speaks of processes 
directly dependent upon stimuli." 

2. Processes may undergo transformation within a single sustained 
presentation. For example, when a series of sentences about mathematics 
is finally “understood” as a demonstration or proof, such a transforma- 
tion has occurred. The insight experiments also illustrate such transforma- 
tions.’ 

3. Some processes are transformed by their consequences. This 
amounts to an acceptance by Koffka of the empirical “law of effect,” but 
the explanation differs from Thorndike’s. The transformation of process 
is at base the same as in the insight experiments, but it often occurs piece- 
meal as a consequence of the experimental arrangements. In the insight 
experiments all the data necessary for the transformation of process are 
present simultaneously, so that restructuring can take place at once. In 
the typical trial-and-error experiment, by contrast, the situation cannot 
be understood until the animal’s activity has itself led to consequences— 
to food, to freedom, and so on. Once success is achieved, the process lead- 
ing to success is transformed. It has a new meaning, a new role in the 
goal-directed activity.® 


The Effects of Repetition: the Consolidation of Trace 
Systems 


The aggregate trace system resulting from repetition is always being 
transformed. With each repetition the trace organization left from pre- 


1 Koffka (1935), pages 549-550. 
18 Koffka (1935), pages 555-556, 
19 Koffka (1935), page 552. 
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ceding processes interacts with the present process to create something 
new. According to the principle of retroactive inhibition (which Koffka 
accepts), preceding individual traces are disrupted by the new learning. 
Repetition can still be beneficial, however, because the trace system be- 
comes consolidated even while individual traces are destroyed. As the 
trace system becomes more fixed, it becomes pre-eminent over process 
and exerts more influence on future processes than such processes affect 
it. Such a trace system is said to become increasingly available; that is, it 
corresponds to what associationists think of as a habit system ready to 
function. A precaution is needed in the interpretation of availability, 
for conditions which make a trace more and more available for mere 
repetitions of one process may make it less available for other processes.”° 
This is one of the dangers of too much drill in the school subjects, because 
drill may have a narrowing Or “blinding” influence.*! 

The treatment of the acquisition of skill by Koffka is sketchy and 
conjectural, for the problem has not been experimentally attacked by 
members of the gestalt group. But skill is made coherent with the process- 
trace theory through a line of thought somewhat as follows. The trace, as 
part of the field of a process, exerts an influence on the process in the 
direction of making it similar to the process which originally produced 
the trace.?? This is close to Guthrie’s statement that we tend to do what 


we last did in the same situation. Highly perfected skills can be repeated 
he process communicates with a stable 


after periods of disuse because th 
trace system to which it then conforms. While the skill is being learned 


the trace is less stable. Through the interaction of trace and process, 
greater stability is achieved. This achievement of greater stability is 
what is meant by improvement in the skill. Because the trace system, 
obeying dynamic laws, also undergoes stabilizing changes over a period 
of no practice, the greater improvement with distributed practice than 


with massed practice is explained. 


Restructuring the Present Field: Insight 
The contrast between trial and error and insight is a subject of some 
misunderstanding because there are empirical facts on the one hand and 
ts on the other. So far as empirical situations 


theories about these fac i 
e are experime 


are concerned, ther 
fumbling, with gradual improvemen ar 
improvement takes place- These may 

20 Koffka (1935), page 547. 


as Luchins (1942). 
2° Koffka (1935), page 553- 


nts which demonstrate a maximum of 
ts and little understanding of how 
lassified as experiments in 
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which learning is by trial and error, without prejudging the processes 
to be invoked in explaining the learning. There are also experiments 
in which the learner obviously perceives a relationship which leads to 
a problem solution, and the experiment may be classified as an insight 
experiment. And there are situations which fall between, where there is 
partial insight combined with rather blind trial and error, The empirical 
grading of situations does not mean that the interpretations have to be 
so graded, that is, that the trial-and-error behavior must be explained 
by a trial-and-error theory, insight by an insight theory, and mixed be- 
havior by appropriate mixtures of the theories. This impression is oc- 
casionally given by writers taking a middle-of-the-road position. The 


theorist, recognizing these empirical differences, tries to account for the 
differences on the basis of a unified set of principles. 


Descriptive Characteristics of Insightful Learning 

We may distinguish between what 
investigating insightful learnin 
for what goes on is by no mea 
with other forms of learning. 

1, A more intelligent organism is more likel 
as it is more likely to be successful at other fo 
Thus older children are more successful at insi 
ones (e.g. Richardson, 1932), and 
pigs. 


goes on in an actual experiment 
g and the distinctive criteria of insight, 
ns limited to insight, and overlaps greatly 
y to achieve insight, just 
rms of complex learning, 


ght problems than younger 
apes more successful than guinea 


2. An experienced organism is more likely to achieve insightful solu- 
tion than a less experienced one (e.g., Birch, 1945b). To some extent in- 
sight depends on past experience, as other forms of learning do. Thus a 
child cannot get insight into a mathematics problem stated symbolically 


unless he understands the conventional signs, even if the problem is other- 
wise suited to his capacity, 


The difference between associatio 
in the implication of association th 
necessary past experience somehow 
gestalt theorists would agree th 
they object to explan 
taking organization in 


lution, 
without 


23 The issue is discussed agai 


theories of Harlow (1949), 
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lem. In one of his early experiments, Maier (1930) provided his subjects 
with all the experience necessary for solving a problem, but only one of 
thirty-seven solved it. The past experience had to be used appropriately 
before solution would occur. : 

3. Some experimental arrangements are more favorable than others 
for the elicitation of insightful solution. Organization is contributed both 
by processes inherent in the organism and by structural patterning in the 
environment, and, of course, basically through the interaction of organism 
and environment. Insight is possible only if the learning situation is so 
arranged that all necessary aspects are open to observation. If a needed 
tool is hidden, its use in solution is made unlikely, or at least more diffi- 
cult. In one form of the puzzle-box it is necessary for the rat to dig 
through a sawdust floor to discover a concealed tunnel which permits 
exit. Because the entrance to the tunnel is concealed beneath a uniform 
bed of sawdust, insight is impossible, and the first solution necessarily 
occurs almost by chance—being aided only by the fact that sawdust- 
digging is within the rat’s habitual action pattern. The parts which need 
to be brought into relationship for solution are assembled more easily if 
they are simultaneously present in perception; for example, it is harder 
for an ape to learn to use a stick which lies on the side of the cage op- 
posite the food than one which lies on the same side as the food ( Jack- 
son, 1942), 

Skilled teachers are well aware of differences between situations in 
which understanding is arrived at easily and those in which it is achieved 
with difficulty—even though the same ultimate steps are involved and 
the same end stage reached. In the favored arrangement the problem is 
So structured that significant features are perceived in proper relation- 
ship, and distracting or confusing features are subordinated. Some mathe- 
matics teachers make problem solution difficult to grasp because they go 
through derivations step by step without an overview of where the steps 
are leading or what the articulating principles are, They teach the neces- 
sary operations, but the final insight eludes the students because of the 
manner in which the proof is arranged. AG 

4. Trial-and-error behavior is present in the course of achieving 
insightful solution. In the presolution period the learner may make many 
false starts and be engaged in activity which can be characterized as 
trial and error. When insight will come (if it does come) is not pre- 
dictable. These two features (initial fumbling and lack of predictability ) 
have been used by opponents of insight either to assimilate it to associa- 
tive learning because trial and error occurs, or to characterize it as 
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mystical, non-scientific, or accidental because the moment of solution 
cannot be predicted. ; ; 

The reply to those who find trial and error in insight experiments, 
and therefore wish to make insightful solution continuous with ordinary 
associative learning, is that fumbling in problem-solving is not mere trial 
and error. Even those who tend to favor trial-and-error interpretations 
have come to speak in terms of approximation and correction * or in 
other ways to indicate that the “try” is a real try and not just any old 
action in the behavior repertory. In the case of adult insight experiments, 
the “try” is often a plausible hypothesis which has to be rejected. A suc- 


cession of such hypotheses may be tried before the appropriate one is hit 
upon. The more intelligent reasoner may actu 


given problem because he commands a great 
bring to its solution. There is a theoretical 
made between blind fumbling and intelligent searching, Merely varied 
behavior is one thing; behavior testing hypotheses is varied also, but ac- 
cording to a different type of organization. 

That random behavior and luck ma: 


y further solution is illustrated by 
some behavior which I observed one summer while assisting Yerkes in 


an insight experiment with a young chimpanzee. The problem set the 
animal was to obtain a banana from a long hollow box, open at both 
ends.” The box, essentially a rectangular tube, was firmly fastened to 
the floor of a large cage. The banana was inserted through a trap door 
in the middle of the box under the watchful eye of the animal, then 
the trap door was securely padlocked. The chimpanzee, after a number 
of unsuccessful efforts to obtain the banana by direct attack—ze 
in either end of the tube with hands and with feet, attempting to lift 
the tube from the floor—seemed to give up temporarily, or, as gestalt 
psychologists say, to “go out of the field.” This extraneous behavior took 
the form of playful cavorting. In this mood the animal incorporated into 
her play the hoe handle which was standing in the corner of 
climbing it, and throwing it. Once the handle fell with its end near the 
open tunnel. The chimpanzee stopped her play, became calm, looked 
reflective, and, for the first time in her history used the pole as a tool to 
push the banana out of the far end of the tube. 

The lucky position of the hoe handle structured the situation per- 
ceptually to make solution easier. It brought the hoe handle in 
sibility and gave direction to the problem-solvi 


ally take longer to solve a 
er variety of hypotheses to 
distinction which ought to be 


aching 


the room, 


as a pos- 
ng behavior. It did not 
24 Dodge (1931). 

25 The box is illustrated in Yerkes (1943). 
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add to the chimpanzee’s past experience, but it made it easier to as- 
semble the experiences appropriate to solution. Out of what was super- 
fluous activity there thus developed a “hint” as to the direction of solu- 
tion. An illustration of the way in which direct hints may aid solution 
is provided by Maiers (1930) experiment previously referred to. By 
giving a few “hints,” in addition to the necessary past experiences, solu- 
tions were obtained to the same problem by a much larger fraction of 
his subjects. 

The objection to insight that it is unpredictable and therefore out- 
side of science is lacking in force. The moment of insight is not the im- 
portant feature in any case. Other features, such as reproducibility of 
the behavior and applicability to new situations are more important. But 
even though the moment of insight for a given animal confronted with 
a given problem is not predictable, it is possible to arrange problems in 
an order of difficulty so that the degree of probability that insight will 
occur is predictable. To assume that all predictions based on past occur- 
rences (empirical probabilities ) imply associative learning *° is to make 
of association a term so broad as to be meaningless. 

The four characteristics described (effects of capacity, prior experi- 
ence, experimental arrangements, and trial and error) are not distinctive 
for insight, although in the course of discussion certain suggestions were 
made of differences in interpretation when the solution is viewed in 


stimulus-response-association terms and when it is viewed as the achieve- 


ment of insight. 


Distinctive Criteria of Insight 

Because of the overlap of what happens in trial and error and in in- 
Sightful learning, can we specify a few defining criteria by which insight 
can be clearly differentiated from other kinds of problem solution? 

An early list of evidences of insight was provided by Yerkes (1927), 
based upon his analysis of photographic records of the problem-solving 
of a young gorilla, following upon earlier experiences with the orang- 


utan and chimpanzee. 
d with insight or understanding of rela- 
tions of means to ends, we are familiar with certain characteristics which are im- 
portant, if not differential. The following is a partial list of features of such 
behavior, It is presented here with the thought that the comparative study of 
behavior with insight, in different organisms, may reveal common characteristics. 
(1) Survey. inspection, or persistent examination of problematic situa- 
26 The assumption is made by Guthrie (1935), page 193, and by Guthrie and 

Horton (1946), page 42. 


In acts which by us are performe 
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tion. (2) Hesitation, pause, attitude of concentrated attention. (3) Trial of more 
or less adequate mode of response. (4) In case initial mode of response proves 
inadequate, trial of some other mode of response, the transition from the one 
method to the other being sharp and often sudden. (5) Persistent or frequently 
recurrent attention to the objective or goal and motivation thereby. (6) Appear- 
ance of critical point at which the organism suddenly, directly, and definitely 
performs the required adaptive act. ( 7) Ready repetition of adaptive reasponse 
after once performed. (8) Notable ability to discover and attend to the essential 


aspect or relation in the problematic situation and to neglect, relatively, varia- 
tions in non-essentials.27 


On the basis of his observations of the young gorilla Congo, Yerkes 
went on to say that these observations confirmed his suspicion “that the 
conventional formula for habit-formation is incom 
of ‘trial and error’ wholly inadequate 
tations.” 


plete, and the process 
as an account of anthropoid adap- 


The three types of evidence for insight most convincing to the ex- 
perimenter seem to me to be three: First, the interruption of movement 
for a period, referred to by Yerkes as one of survey, 
followed by the critical solution. This combines Ye 
(5) and (6). In the chimpanzee experiment descr 
was initiated following the “lucky” fall of the hoe handle near to the open- 
ing of the box. Second, the ready repetition of the solution after a single 
critical solution. This is Yerkes’ point (7). The ape in the illustration 
given was returned to the experimental room on the followir 


ng day. Every- 
thing was arranged as before. When the banana was locked into position 
and the chimpanzee released, there was 


a single flip of the lock (it might 
have been unfastened!) and then the animal went directly for the hoe 
- handle, carried it over a shoulder in a 


before, and proceeded to use it app. 
byplay, no dropping it on the floor. 


inspection, attention, 
rkes’ points (1), (2), 
ibed above, this stage 


, a kind of insight experi- 
ment that Yerkes invented. Here insight is indicated by a perceived re- 


lationship such as “the middle one of the open doors” “the second from the 


27 2 7: 5 F itari ai 
a (1927), page 156. For other criteria of insight, see Pechstein and Brown 


(1989) 
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right of the open doors.” In human subjects these generalizations are 
often put into words when the problem is solved. 

These three types of evidence are not entirely clear-cut and un- 
ambiguous, but they serve reasonably well when the whole context of 
the problem is considered. A rat pauses, too, before making a jump in 
the Lashley discrimination apparatus. How, one may ask, do you dis- 
tinguish this pause from the kind Yerkes speaks of? The answer lies 
partly in the greater freedom of Yerkes’ animals to move about and to do 
other things. Interrupted behavior is more readily observed when the 
possibilities of varied behavior are rich. So too the repetition of a first 
solution can be made without insight, as shown by the Guthrie and Hor- 
ton cats. However, when the release pole was moved a few inches they 
were unable to use their prior pole-pushing habits, thus failing to live up 
to the third kind of evidence for insight.” But a chimpanzee or gorilla 
that has learned to obtain a banana with a stick will search for a stick 
when a banana is out of reach, or will improvise a stick substitute out of 
bundled straw. Having reacted to the more abstract relationship of 
stick-as-a-tool-to-obtain-banana, it is not disturbed by a slight change in 
the environment. 
ations of a perceived relationship to another situation in 
which it is applicable is the equivalent of transfer of training. The gestalt 
writers prefer to speak of it as transposition, on the pattern of a trans- 
posed melody. What is transferred is a relationship OF generalization, 
although the contents in the two situations may be entirely changed. 

Stimulus-response psychologists recognize a corresponding application 
of old learning to new problems through stimulus generalization. While 
there is some overlap between the theories at all points (hesitation, 
repeated solution, transposed solution ) there is no denying that, descrip- 
tively, behavior characterized as insightful occurs. The residual differ- 
ences between the two theories lie more in their theoretical interpretations 
of what happens than in the descriptive overlap between the simpler 
forms of trial-and-error learning and the more convincing illustrations of 
insight. At the level of observation, insight is not the explanation of 
problem solution for a gestalt psychologist any more than for a stimulus- 


response psychologist. F 
Köhler yes about the misinterpretations to which his book on 


apes led: 


Such applic 


ression [insight] in a description of the intelligent 


When I once used this exp’ : 
ehavior of apes, an unfortunate misunderstanding was, it seems, not entirely 
5 


28 Guthrie and Horton (1946), page 17. 


240 THEORIES OF LEARNING 


i the animals were found to be capable of achievements 
‘i i ee a ee to occur below the human level. It was then stated 
i eli accomplishments clearly involved insight. Apparently, some readers 
interpreted this formulation as though it referred to a mysterious agent or 
faculty which was made responsible for the apes’ behavior. Actually, nothing of 
this sort was intended when I wrote my report... . No question of inventions 
or other outstanding intellectual achievements is here involved, and, far from 
referring to a mental faculty, the concept is used in a strictly descriptive fashion.?® 


To the gestalt psychologist, insight exemplifies rather more clearly 
than other forms of learning the applicability of the laws of organization. 
It is these laws that explain insight, and it is the gestalt contention that 
the same laws explain other forms of learning. Only in that sense is in- 
sight for them the typical or characteristic kind of learning, 


Productive Thinking 


Wertheimer had lectured on thought processes for many years, but 
had published only a few fragmentary papers during his lifetime. He 
had, however, completed the manuscript of a small book just before his 
death. This was edited by his friends and appeared under the title Pro- 
ductive thinking (1945). In it a number of his experimental studies are 
summarized in his characteristic way, with penetrating qualitative analy- 
sis of simple situations serving to illustrate the differences between his 
approach and other approaches to which he is objecting, 

The two chief competing alternatives to adopting the gestalt ap- 
proach to thinking and problem-solving are said to be formal logic on 


the one hand and association theory on the other. Both of these alterna- 
tives are believed to be too limited to e 


when an individual confronted with a 


out between a blind solution in 
a sensible solution in which 


is often an unsuccessful applica- 
o be inappropriate, Experi- 
ments are cited, for example, in which school children are taught to 
find the area of a parallelogram by dropping lines from two corners 
perpendicular to the base, thus converting the figure to a rectangle, 
whose area can be found. Children who could do the examples per- 
fectly were baffled, however, when a parallelogram was presented in a 
new orientation, so that the “correct” steps of the procedure led to con- 
fusing results, They had learned the solution according to a blind pro- 


29 Köhler (1947), pages 341-342. 
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cedure. By contrast, the solution of a five-and-one-half-year-old child is 
reported. 


Given the parallelogram problem, after she had been shown briefly how to 
get at the area of the rectangle, she said, “I certainly don’t know how to do that.” 
Then after a moment of silence: “This is no good here,” pointing to the region at 
the left end; “and no good here,” pointing to the region at the right. 
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FIGURE 30. Troublesome parts in child’s attempt to apply rectangle theory to 
parallelogram. After Wertheimer (1945), page 48. 


“It’s troublesome, here and there.” Hesitatingly she said: “I could make it 
right here . . . but... .” Suddenly she cried out, “May I have a scissors? 
What is bad there is just what is needed here. It fits.” She took the scissors, cut 


vertically, and placed the left end at the right.*° 


| A B 
FIGURE 31. Child’s solution of parallelogram problem with a scissors. After 
Wertheimer (1945), page 48. 


Another child, given a long parallelogram cut out of a piece of 
paper, remarked early that the whole middle was all right, but the ends—. 


She suddenly took the paper and made it into a ring. She saw that it was 


all right now, since it could be cut vertically anywhere and made into 


a rectangle. ; 
s these the solutions appear in an orderly way, in 


In cases such a an 
ation. It is this structural ap- 


line with the true “structure” of the situ 


proach which Wertheimer emphasizes. . 
Children readily grasp such “structural” solutions unless they are 


badly taught in an atmosphere of blind repetitive drill. Given figures 
such as those on the left in Figure 82 and those on the right, they can 
easily sort out the unsolvable ones from the solvable ones. It is futile to 


30 Wertheimer (1945), page 48. 


THEORIES OF LEARNING 
242 
e, says Wertheimer, that these distinctions are made on the basis of 
ere as the associationist seems to believe. Children make the dis- 
tinctions because. they know the essential nature of the solution. The 
structural features and requirements of the situation itself set up strains 
and stresses which lead in the direction of improving the situation, that 
is to say, to solving the problem. 


A-Figures B-Figures 


FIGURE 32, 
It is possible to change 
the A-figures 


y of school children to solve 
the fami iarity of the figures, 


-figures is said to depend on something other than 
From Wertheimer (1945), page 20, 


The implications of Wertheimer’s point of view for teaching are fairly 
clear. It is always preferable to proceed in a manner which favors dis- 
covery of the essential nature 
which require filling in, so that, 
the proof is “organic” rather th 


an “mechanical.” 21 
#1 Duncker (1945), page 45. 
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EXPERIMENTS ILLUSTRATIVE OF GESTALT THEORY 
OF LEARNING 


Three sets of experiments have been selected as representative of 
the topics stemming from the interests of Köhler, Koffka, and Wertheimer. 
These are: (1) experiments devised to test the hypothesis that memory 
traces change in a systematic manner with time, (2) experiments con- 
trasting organization with association in retroactive inhibition and related 
phenomena, and (3) experiments testing the relative roles of understand- 
ing and rote learning in memorization, retention, and transfer. The famil- 
iar insight experiments have already been referred to.** 


Does the Memory Trace Undergo Systematic Change? 


The position first advanced by Wulf (1922), that memory traces 
change according to the dynamic principles of organization, received 
early support from the experiments of Allport (1930) and Perkins (1932). 
Gibson (1929) found verbal factors a strong influence in producing what 
Wulf called normalizing, that is, each reproduction’s becoming more like 
a real object on successive attempts to reconstruct what was seen.” He 
therefore was somewhat critical of the theory of intrinsic factors within 
the memory trace producing the change. 

There is no doubt but that reproductions are often inaccurate and 
often exhibit the kinds of changes that Wulf suggested. But these changes 
are often found in the very first reproduction, so that the change may be- 
long more to perception than to memory. The crux of the matter lies in 
the finding of progressive changes with lapse of time. 

Most of the experiments that have shown changes demanded by the 
theory have used the method of successive reproductions, which intro- 
duces the disturbing factor that the second reproduction is affected by 
the first, the third by the second, and so on. An extreme illustration of 
changes which can be made from one reproduction to another is that 
shown in Figure 33, where each person made one reproduction based on 
Perceiving what the person before him had drawn. Mere lapse of time 
between seeing the original and reproducing it would scarcely have re- 
Sulted in, say, the tenth reproduction. 

In order‘to remedy the defect that experiments testing the theory 


82 Reyi additi xperiments on insight may be found in Woodworth and 
Schlosberg ( SOE cca Sane of the gestalt theory of insightful learning and 
its conditions may be found in Osgood (1953), pages 605-610. 

33 Gibson’s results were extended and confirmed by Carmichael, Hogan, and Walter 


(1932), Brown (1935). 
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REPRODUCTION 3 REPRODUCTION 8 REPRODUCTION 9 


PNAS 


REPRODUCTION I0 REPRODUCTION I5 REPRODUCTION 18 


FIGURE 33. Changes in figures within the method of s 


i erial reproduction, Each 
person views the reproduction by 


i y the person preceding him, and then passes his re- 
production to the next person. There are progressive changes and elaborations, as in 
the series illustrated. From Bartlett (1932 » Pages 180, 181, by permission Cambridge 
University Press, England, and The Macmillan Company, N.Y, 


used successive reproduction predomin 
ferent subjects reproduce the sai 


als of time, 
with but a single time interval 


T these condi- 
as occurred were 

» and even might be present 
when the subject copied the figure present before him.: 


34 Related experiments, with primarily negative results, are those of Irwin and 
Seidenfeld (1937), Irwin and Rovner (1937), Seidenfeld (1 


938), Goldmei 1941), 
Hanawalt and Post ( 1942), Tennies (1942), Hanawalt (1952), ii 


DE 
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Hebb and Foord (1945), in a carefully controlled experiment, used 
the method of recognition rather than that of reproduction, asking the 
subject to select from a series of figures the one that had been originally 
presented. There was only one retest for each figure learned by a sub- 
ject, but the retest was made after a lapse of time differing from one 
subject to another. The objects presented visually were a circle with a 
small portion of its circumference incomplete and an arrowhead figure. 
Presumably there would be a change either toward closure (leveling) 
or toward emphasizing the broken part (sharpening). The circle may be 
used for illustration, Testing was done by selecting from a series of 
broken circles the one which was said to have been seen before. Although 
there was some variability, the trends were not in support of Wulf’s 
theory, and there were no progressive changes evident with the passing 
of time. 

The conclusions to be reached from these experiments are certainly 
damaging to the gestalt interpretation of spontaneous changes in the 
memory trace. They do not deny the gestalt interpretation of changes in 
perception, or of progressive changes in successive reproduction, where 
Successive perception is involved. ; 

Bartlett's (1932) experiments on repeating stories over long intervals 
are perhaps more instructive in some ways than the fexpeninents) repeating 
line drawings. They reveal rather striking tendencies toward structuring 
the story so that it “makes sense,” and as the story grows older it also 
gets shorter and irrelevant details dropped out. This led Bartlett to 
speak of the productive nature of memory as contrasted with its reproduc- 
tive nature. However, the gestalt claim with respect to changes in the 
trace was based on the experiments of Wulf, and the refutations have 
been based on experiments using very similar materials. A common-sense 
use of Wullf’s classification of changes into leveling, sharpening, and nor- 
malizing has been made by social psychologists,** but these uses probably 
do not require confirmation of spontaneous changes in the trace. Krech 
and Crutchfield state, for example, that successive reproduction is char- 
acteristic of “real life,” that is, “an occasional recall (without checking on 
) of an event that has occurred in the past.” 3 
strate how a theory leads to a particular variety 
d by other theories. Despite the negative re- 
seful. Experimentation is not intended merely 


the accuracy of the recall 

These experiments illu 
of experiment not suggeste 
Sults, the experiments are u 


to confirm theories; an equally important role is to refute, to correct, 


s Eg ‘t and Postman (1947). 
g., Allport and Postman 
38 Krech ai Crutchfield (1948), page 127, footnote 1 
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and to delimit the claims of a theory. The by-products have given some 
support for gestalt perceptual theory while almost invariably refuting the 
claims respecting the nature of spontaneous changes in the memory trace 
over time. 

The methodological possibilities have no 
good ** points out a failure to use 
are known on the basis of actual p 
interpretation of the figure in dim light, etc. 


geneous items stand out like a figure on the ground, exactly as j 
tion. This favorable structuring in perception turns out to 


also for recall. Müller (1987) went on to show that the recall of isolated 
or unique material was favored even when the uni 
tributed over eight separ: 


was the unexpected recal] of w 


the rarer unique items found on] 


In the study of Köhler and von Restor (1935) an ingenious ar- 
rangement was used to test “ 


” when there was a mini- 
21 Osgood (1953), pages 590-591, 
38 von Restorff (1938), Köhler and von Restorff (1935 Bartel ü 
(1937). Aspects of these experiments were teviewèd by ke (1938) (isa Maller 
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mum of induced “set” to recall. The laws of dynamics are best revealed 
when the results of process-trace interaction emerge naturally, without 
any special instruction to “search” for relationships. 
All subjects first worked on the following computational problem: 
x= 21 (91/7 + 6) +14 


Most subjects went through the usual steps: 


First, 91/7 = 13 (1) 
Therefore, 
x= 21 (13 + 6) + 14 (2) 
= 2] (19) + 14 (8) 
Performing the major multiplication: 
21 
x 19 
189 
21 
399 (4) 
EEn x = 3899 + 14 = 413 i (5) 


At this point the experimenter pointed out, somewhat casually, that 
the subjects might have noticed a somewhat easier wiy of multiplying 
21 X 19, using the formula (20+ 1)(20 1) = 20? — = 400 —-1= 
899. The later retention test required recalling the pertinence of the 
Principle that (a+b)(a—b) =e — b, , 

After this exercise in computation, and the casual suggestion of an- 
other solution, half the subjects went on to do problems in arithmetical 
computation, while the other half worked on matchstick problems. Then 
both groups were again assigned a problem related to the one outlined 
above. The prediction was that the new problem would be detected as 
Similar to the first one more frequently by those working on matchsticks 
than by those who had continued to work on arithmetic. That is, the 
Computation problem would be more distinctive for those for whom 
it had been the only computation problem, and the new process would 
therefore be more likely to communicate with the memory traces from 
the first problem. S 

Terei was borne out. The “short-cut a S 
frequently by those whose intervening activity was with matches an 
by those whose intervening activity was additional arithmetic. (See 
Table 4.) Upon questioning, the other subjects could easily remember 
having been told about the short cut. This bears out the importance of 


method was used more 
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arranging the experiment so as to test spontaneous recall if the PiE 
of the process are to be exposed. Bartel (1937) continued with relate 
experiments, with consonant results. W f 

Müller (1937) also studied a problem familiar in the history of 
retroactive-inhibition experiments, the problem of the point in the inter- 
val between learning and recall at which the interpolated material pro- 
duces the greatest amount of interference. Miiller’s findings supported 
the view that retroactive inhibition was at a maximum near the point of 


reproduction, suggesting that the action was really proactive rather than 
retroactive. 


TABLE 4. Spontaneous Recall as Affected by Homogeneous and Heterogeneous 
Intervening Activity (Köhler and von Restorff, 1935, page 79) 


Continued with Worked with 


Arithmetic Matchsticks 
(Homogeneous ( Heterogeneous Total 
Activity) Activity) Subjects 
9 27 36 
25 10 35 
34 37 71 


Chi square = 15.3 
P less than .001 


All of these experiments show a resemblance in design to experiments 
done in the tradition of associative learning. What is unique about them, 
or about the theories used to explain them? 

First, the experiments commonly employ both homogeneous and 
heterogeneous materials, in order to point to principles of organization 
from perception as applying also to retention and recall. Added cogency 
is given to these arguments by later experiments of Werner (1947), 
where he showed that boundary phenomena which, in perception, pre- 


vent the confusion of forms, also operate in proactive inhibition. Material 
that was strongly “bounded” did not intrude 


Second, the conception of Process-trace interaction is said to differ 
from explanations according to associative connections. Köhler notes, 
for example, that in the usual retroactive-inhibition experiment traces 
are poorly distinguished and are densely crowded, so that they no longer 
function according to their intrinsic natures. Köhler points out that the 
experiments on spontaneous recall do not have these objections, for they 
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permit the traces to function more normally, with results as predicted 
from principles of organization.” 


Drill versus Understanding in Memorization and Retention 


In his book on Organizing and memorizing (1940), Katona reports 
a number of experiments inspired by Wertheimer, who contributed the 
foreword to the book. 

Katona attempts experimentally to define and characterize two 
types of processes leading to recall: rote memorizing and understanding. 
When a list of nonsense syllables is memorized, the learner is forced 
to use the former process, because there are no organizing principles 
which will permit understanding to help. On the other hand, there are 
many kinds of problems which illustrate principles; in such cases learn- 
ing by understanding will have advantages. Simple and ingenious ex- 
periments were designed in which it was possible to commit the same 
material to memory with or without understanding, and then to test 
the results on new learning. 

One experiment consisted in the teaching of simple match tricks 
of the kind illustrated in Figure 34. The problem is to move 8 lines and 
uares left. The possible solutions are shown 


in so doing to have only 4 sq 
P le principle involved in all solutions, which 


in the figure. There is a simp 
is that no side must be used for more than one square. (There are 16 


matches making the original 5 squares; because these 16 matches are 
now to make 4 squares it is evident that each side can be used but 
once.) A number of different tasks were used, all ringing changes on 
the same general pattern. 

In his Experiment A, three groups were used, a control group, a 
memorization group, and a group practiced on examples. No preliminary 
Practice was given the control group. The memorization group was shown 
the first problem (that of Figure 34), with one of its solutions. Then 
this same problem was presented in rotated form and the same solution 
shown. The memorization group was thus shown essentially the same 
Solution four times, with the problem very slightly rearranged geo- 
metrically. The group which had practiced on examples experienced six 
different transitions from one situation to another in their preliminary 
Practice period, although no general principle was enunciated. Follow- 
ing this preliminary practice, each of the three groups was given four 
tasks to solve; these were all new to the control group, but one of the four 


8° Köhler (1940), page 155. 
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tasks was familiar to the memorization and examples group. There was 
a retest four weeks later, with a new control group. In this retest three 
new tasks were presented along with one of the originally practiced tasks. 
Successes on the practiced task were about alike for the two groups, in 


Correct Solutions 
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TABLE 5. Comparison of Scores ° on New Tasks of Grou 


ps Practiced in Different 
Ways (Katona, 1940, page 86) 


Immediate Retest After 


Test 4 Weeks 
Control Group ......... wise LOG 2.20 
Memorization Group =) 79 2.84 
Group Practiced on Examples 3.92 6.24 


° Weighted scores, with possible score of 10, The differences between the examples 
group and the others are Statistically significant. 
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Why was the group practiced on examples superior? Presumably 
because the more varied experience had produced some measure of un- 
derstanding. Verbal reports showed that this understanding was frag- 
mentary, taking such forms as “filling up holes increases the number of 
squares”; “spreading it out decreases the number of squares.” In any case, 
the method produced a more varied attack, and alternative solutions to 
the familiar problems were used in a way not true for the memorization 
group. Memorization sometimes tends to narrow rather than to increase 
the range of problem-solving. 

Katona concludes that learning with understanding not only im- 
proves retention of that which is learned, but better qualifies the learner 
to move forward to new learning. Thus understanding is important for 
transfer, Some of Katona’s conclusions have been criticized by Melton *° 
on statistical grounds, but Melton accepts as demonstrated the following 


points on transfer following learning by understanding: 
has learned by rote memorization is little better 


1. A group which 
than an unpracticed contro] group in the learning of new tasks, while a 
group which has learned with understanding learns new tasks much more 
readily, 

2. When once tasks have been learned with understanding, repeated 
tests with new tasks result in progressive improvement in performance. 
By contrast, if the repeated tests are done with practiced tasks, the effi- 
ciency of performance on new tasks is reduced. 

Katona concludes from all this that there is a real difference in what 
happens when meaningless memorization goes on and when learning 
takes place with understanding. He believes that the two kinds of learn- 
ing are genuine and should be distinguished. This does not mean that 
he believes that there are two fundamental prototypes of learning, cor- 
responding to the two types of arrangement under which learning takes 
place, According to him, the underlying factor is organization. Only when 
better organization fails do you get the extreme picture of rote learning, 
which is itself a special form of organization resorted to with compara- 
tively incoherent materials. The distinction in kind is not one of ease or of 
difficulty, for it is sometimes easier to learn by rote than to learn by under- 
standing. The advantages of learning by understanding are that mean- 
ingful learning is applicable to new situations and is more enduring. 


e Katona (1942). Some features of Katona’s ex- 
epeated and extended by Hilgard, Irvine, and 
d Irvine (1954). 


40 Melton (1941). For a reply se 
Periments have been satisfactorily r 
Whipple (1953), Hilgard, Edgren, an 
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ESTIMATE OF GESTALT THEORY OF LEARNING 


Gestalt Theory and the Typical Problems of Learning 


The gestalt psychologists find a somewhat distorted emphasis in 
conventional treatments of learning, so that the problems typically empha- 
sized are not the most natural selection of problems from their stand- 
point. In order to maintain the symmetry of comparative study of the 
different positions, however, the same list will be followed which w. 
used to summarize the stimulus-response positions, 

1. Capacity. Because learning requires differentiation and restruc- 
turing of fields, the higher forms of learning depend very much upon 
natural capacities for reacting in these ways. Poor methods of instruction, 
however, may be responsible for some inability to face new situations, 
for a “blindness” which might be confused with stupidity. 

2. Practice. Changes go on within repetition, not as a result of repe- 
tition. Practically all psychologists now agree that this is so, but they 
differ with regard to the pertinent processes which go on within the 
repetitions. From the gestalt point of view, repetitions 
exposures, bringing to light relationships to enter into restructurization, 
To Koffka, they also make possible the consolidation of trace systems, 
which is as near as any gestalt psychologist comes to saying that re- 
sponses become fixed by repetition, 

3. Motivation. Goals represent end-situations, and as such modify 
learning through the principle of closure. The processes leading to the 
Successes or failures get transformed by their consequences, The em- 
pirical law of effect is accepted, but Thorndike’s interpretation of the 
blind action of effect is denied. 

4. Understanding, The perceiving of relationshi 
the relationships between parts and whole, of me 
emphasized by the gestalt writers. Problems ar 
structurally, organically, rather th 
running off of prior habits, 


as 


are successive 


ps, awareness of 
ans to consequences, are 
e to be solved sensibly, 
an mechanically, stupidly, or by the 


one situation may be applicable to another. This is in some respects like 

Judd’s generalization theory of transfer,“ There is something in com- 

mon between the earlier learning and the situation in w 

found, but what exists in common is not identical piec 
* Judd (1908), 


hich transfer is 
emeal elements 
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but common patterns, configurations, or relationships. One of the ad- 
vantages of learning by understanding rather than by rote process is 
that understanding is transposable to wider ranges of situations, and less 
often leads to erroneous applications of old learning. 

6. Forgetting. Koffka relates forgetting to course of changes in the 
trace. Traces may disappear either through gradual decay (a possibility 
hard to prove or disprove), through destruction because of being part of 
a chaotic, ill-structured field, or through assimilation to new traces or 
processes, The last possibility is familiar as a form of theory of retro- 
active inhibition. Traces which continue to exist may at a given moment 
be unavailable. While little is known about this, Koffka believes it must 
have something to do with ego organization.‘* Finally, there are instances 
of forgetting in which a process fails to communicate with an otherwise 
available trace. The forgetting of an intention * would presumably clas- 
sify here. This is also an ego problem. 

In addition to such forgetting, there are the dynamic changes which 
take place in recall, so that what is reproduced is not earlier learning 
with some parts missing, but a modified trace which is productive as well 


as reproductive. 


General Aspects of Gestalt Theory 

In discussing gestalt psychology solely as a theory of learning, some 
of its more general features have been sidestepped, especially its philo- 
sophical orientation and its relation to biology. 

The objection to association theory of learning is part of the holistic 
emphasis within the general theory, and coherent with the opposition to 
“atomistic” explanations according to connections between parts. The ob- 
jection to sensations as elements of perception (“the bundle hypothesis” ) 
is carried over in the objection to stimulus-response connections as ele- 


ments of habits. 
The phenomenologic 
gists as opposed to the pr 


al standpoint, often stated by gestalt psycholo- 
evailing positivistic position, is not easy to char- 
acterize satisfactorily in limited space. Phenomenal observation is more 
“subjective” than behaviorism, but less sophisticated and specially trained 
than the introspection recommended by Titchener, The recommended 
variety of observation is natural and childlike, appreciative rather than 
analytical. For example, Köhler’s later accounts of rink ia upon 
such a phenomenological description of events. Insight in this context in- 


#2 Koffka (1935), pages 525-5217. 
48 Birenbaum (1930). 
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cludes such everyday experiences as enjoying a cool peta We refer the 
pleasures we feel to the taste of the drink and to our thirst, through a 
natural “requiredness” of the relationships. We do not attribute our pleas- 
ure to accidental accompaniments, “not, for instance, to the spider on 
the wall, nor to the size of the chair before me, nor to the thousands of 
other things in my environment.” + These everyday experiences of the 
determination of events, here called insight, are said to refute the theory 
of chance contiguous associations.** 


The interpretation of gestalt psychology as a field theory rests largely 
on the evidence assembled by Köhler in his Physischen Gestalten (1920), 
showing the relationship between gestalt laws of organization and well- 
established principles in physics and biology. The biological emphasis 
within gestalt has been carried forward by Gelb and Goldstein.+# 

Kohler has all along held isomorphism to be central to the under- 
standing of gestalt psychology. This is the principle that the underlying 
brain fields correspond in their dynamic aspects to phenomenal experi- 
ence. He believes psychophysical isomorphism follows from the principles 


of evolution, so that mental events are not alien to physical events, but 
are rooted in the laws of nature: 


Some fundamental properties 


of nature rather than any special arrange- 
ments in the organism are, I believe, 


the counterparts of essential mental facts 47 


We shall return later (Chapter 12) 


to Kohler’s more specific hy- 
potheses about brain fields, and his experiments related to them, 


The Interrelationship Between Perception and Learning 


assical gestalt position would be incomplete 
success with which the basic thesis has been 


laws of perception and of learning are alike. 


44 Kohler (1947), page 346, 

45 For further discussion of the phenomenolo, ical position as inte ted b talt 
psychologists, see MacLeod (1947) (1951). T T e, of Saves ana Combs 
(1949) does not halong, within the classical estalt tradition, 

48 The collaboration f 


etween Gelb and Goldstein b in the study di 
soldiers of World War I, Gelb and Goldstein (1918 G ld a n 
presented in book form in Goldstein (193 


sychology picked up and extended the bi 
logical studies, but it must be considered a: } 
part of the central core of development. S; aey rather than 


affiliations can be found 
in Werner (1948), but he is not, strictly speakin » one of th ost 
‘7 Köhler (1938), page 400. a —" se Senet erase 


"s organismic 
and assimilated embryo- 
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ingfulness, and understanding, and has called attention to the importance 
of the structure of the problematic situation. The laws of similarity (or 
equality ), of proximity, of closure, and of good continuation, are not very 
convincing as laws of learning, and the attempts made by Koffka to sum- 
marize his point of view on learning in terms of them is not very successful. 
As we shall see in later chapters, the reintroduction of cognitive fea- 
tures into learning has been important, whatever one’s judgment of the 
importance of the laws of organization may be. 


Insight as an Alternative to Trial and Error 

It is implied in gestalt theory that a learner acts as intelligently as 
he can under the circumstances which confront him, so that insightful 
solution of problems is the typical solution, if the problem is not too diffi- 
cult and the essentials are open to inspection. Fumbling and trial and 
error are resorted to only when the problem is too difficult, either 
intrinsically, or because of the way in which it is presented to the learner. 
This reverses the associationist position that trial and error is the typical 
method of attack and that reasoning is essentially “mental” trial and er- 
ror. The empirical facts of insight are as satisfactory as the empirical 
facts of trial and error. Learning theorists are not yet in agreement on 
one of the three possibilities: (1) that all learning is of one kind, basically 
like trial and error, from which insightful learning can be derived; (2) 
that all learning is basically of one kind, like insightful learning, with 
trial and error a derivative form; or (3) that more than one kind of 
learning occurs, of which trial and error and insight are two illustrative 
examples. A strong case can be made for the gestalt point of view that 
blind learning is not the prototype of all learning. But this is one of the 
unresolved issues, to face us in succeeding chapters of this book. 

In any case, the gestalt psychologists haye sharpened the lines of 
cleavage in thinking about problems of learning, and by questioning most 
of what was conventionally accepted they have been of real service, re- 
gardless of whatever verdicts or compromises the future may produce. 
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Chapter 8 


LEWIN’S FIELD THEORY 


LTHOUGH a member of the Berlin gestalt group, Kurt Lewin (1890- 
Aia early began to break new ground, especially in studies of motiva- 
tion. After a number of productive years in Germany, he spent his last years 
in the United States, teaching from 1932 successively at Stanford, Cornell, 
Iowa, and the Massachusetts Institute of Technology, with a term also at 
Harvard. He became widely known and highly influential, and because of 
a warmth and infectious quality of personality developed a devoted follow- 
ing. Although his theory is not, strictly speaking, a psychology of learning, 
much of it is relevant, His doctoral dissertation, in 1917, was critical of the 
then current association theory of memorization and retention. The theory 
that he developed as a corrective was called by various names as it pro- 
gressed, such as topological psychology, but it is now usually referred to by 
its friends and adherents as a field-theoretical position. It is satisfactory to 


call it simply a field theory, without implying that it resembles in detail the 
field theories of modern physics, 


THE BEGINNINGS OF LEWIN’S THEORY 
The Lewin-Ach Controversy 


Ach belonged to the Wiirzbur: 


g school which had supplemented con- 
ventional association with sets, de 


termining tendencies, and the like, thus 
adding something more dynamic to the standard theory of strengthening 


associations through repetition, He designed a series of experiments in 
which a habit of reproducing nonsense syllables was set up through re- 
peated exposure, and then a determining tendency was introduced which 
might either facilitate or hinder the tendency created by practice, Thus 
“will” was set against “habit,” and a possible quantitative measute of will . 
proposed. 
1 Ach (1910). 
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There were three arrangements of nonsense syllables in the original 
learning experiments of Ach. One was the usual or normal arrangement of 
heterogeneous pairs, such as bol-pid; another arrangement called for a 
rhyming syllable, as rik-tik; the third called for inversion, as kep-pek. The 
subject learned one series of eight pairs of each of these kinds. The stimulus 
syllable of each pair was later used in test situations. In the test series, 
instructions were to reproduce what had been practiced, or to rhyme, or to 
invert. Thus the determining tendency could either sum with or contradict 
the associative reproductive tendency, or the reproductive factor could be 
present alone. Strength of reaction was inferred from latency of response. 
Ach found some results coherent with his theory, that the shortest reaction 
times were in the cases in which the associative (reproductive ) tendency 
coincided with the determining tendency superimposed by instructions. 
His other results were somewhat irregular, but were interpreted by him as 
giving general support to his position. 

Lewin’s objection to Ach’s work was not that he introduced the deter- 
mining tendency, which Lewin essentially accepted, but that he added it 
to the conventional association theory without seeing that it was a foreign 
intrusion and led logically to the abandonment of the simple association 
theory, Lewin argued that the conflict which appeared in the experiment 
—and, indeed, conflict was convincingly demonstrated—was not between 
association and determining tendency, but between two determining tend- 
encies, It was his belief that there is no “force” within mere association to 


lead to reproduction, that reproduction itself must be motivated and im- 


plies a set to reproduce. z 
Lewin’s experiments * were much like Ach’s, but a few modifications 


greatly changed the resulting interpretations. First a series of eight pairs 
of nonsense syllables was learned in which the second syllable rhymed 
with the first, dak-tak, ged-ked. Then a list of eight pairs was learned in 
which the response syllable was the stimulus syllable spelled backward. 
Note that the only change in response syllables was in the consonants, the 
vowels being alike in both the rhyming and the reversed pairs. 

The first test series consisted in instructing the subjects to respond to a 
Set of syllables by changing the vowel. Syllables from the previously prac- 
ticed series were inserted among unfamiliar syllables. if Ach's theory were 
sound, there should have been some conflict between “habit” and deter- 
mining tendency,” but none was found. There were no. wrong reactions, 


and the Jatent times for control and critical syllables were alike. 
A second test, however, produced conflict and retardation, such as 


? Lewin (1917) (1922). 
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Ach found. When the subjects were told to rh 


yme a series including both 
syllables previously 


rhymed and syllables previously reversed, mistakes 
were made on the syllables previously reversed. Similar] 


y, with instructions 
to reverse, 


errors were made in response to syllables previously rhymed 
when they were presented in the midst of syllables previously reversed. 
The interpretation is that the instruction to thyme to syllables previously 
learned as rhymes was acted upon as though it had been an instruction to 
reproduce. When, then, a previously reversed syllable appeared, the faulty 
self-instruction to reproduce conflicted with the correct instruction to 
rhyme, and interference was shown. 
The occurrence of conflict in one case and not in the other, although 
the prior habits fixed by association were equally strong before both tes) 
confirmed Lewin in his belief that the existence of an associatio 
later terms, a cognitive structure 
activity. There is always 
the activity of reproduci 
Lewin’s contention, while it remain 


n (or, it his 
” for Mantal 
activity, inclut ding 
syllables, N 
ce 
ind 
sems, For ehan pha 3 
ugh repeated rein- 
aks through instruc- 
nflict between “habit” 


to habits more intimately tied u 
a conditioned avoid 


forcement, has a certain urgency about it, so that it bre. 


ssible to treat the cor 
in this case much 
Ach’s position.: 


spond. This “habit” is more than 


ading to reproduction of associated items, 
This early recognition of the dynamic organization of reproductive 
tendencies is reflected in later experiments on the relationship between 
tension and retention, as shown j ed and unfinished 
tasks,* and the tendency to resum 

arises." 


Psychological Tension Systems 


Much that foreshadowed his later s 


ystematic writ 
important paper by Lewin in 1926, 


ings appeared i 
shortly after K aP en 


öhler’s and Koffka’s 
? Hilgard ( 1938). 

1 Zeigarnik (1927 NF 

5 Ovsiankina ( 1928). 
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views were becoming known in America. Lewin’s position was first intro- 
duced to American audiences by J. F. Brown (1929), three years after this 
paper appeared. 

For one thing, Lewin set himself squarely in favor of a psychological 
analysis of the actual situation, and against what he called accomplishment 
concepts. The practice curve for typewriting, scoring typing according to 
words typed per minute or some other convenient unit, rises, shows a 
plateau, reaches a limit, and so on—as though all that happened could be 
described as “typewriting.” A more psychological analysis would show 
that what the beginner is doing and what the skilled typist is doing are 


entirely different: the beginner's process is one of searching, but the 
kill is mastered. This objection to accom- 


h Guthrie’s objections expressed in much 
è The difference is that Guthrie wishes 
ts, which Lewin would class with 


searching process drops out as s 
plishment is to be compared wit 
the same way, several years later. 
to restrict his measurements to movemen 
accomplishments as lacking in sufficient psychological meaning. 

The typical non-gestalt conception of psychological causation, says 
Lewin at this time, is adhesion. One thing is attached to another so that 
revival of the first brings forth the second. Gestalt psychology is sometimes 
misunderstood as correcting association by accepting the adhesion prin- 
ciple but permitting it to apply only to parts of wholes, This notion is flatly 
rejected, No mere coupling principle can provide the energy for psychical 
activity, Psychological behavior depends upon energy related to psycho- 
logical tension systems. ; 

Perception of the world of meaningful things and events does not pro- 
vide the energy for activity, although activity aroused under need-tension 
Systems may be stecred by perception. This is not unlike Skinner’s later 
Notion of the role of the discriminated stimulus in operant behavior: the 
stimulus does not elicit the behavior, but it does set the occasion for it. The 
Sequence of events from perception to satiation is as follows. The percep- 
tion of an object or event may give rise to a psychological tension (e.g. 


go 
desire), or it may communicate with a state of tension already existing in 
such a way that this tension system iera pu assumes control.oversmotor 
behavior, The aroused “valences” (attractions and repulsions of perceived 
goal objects) act as environmental forces steering subsequent behevitr, 
This behavior then leads to satiation or to the resolution of tension so 


that a state of equilibriu 


At this early date (1 
ridiculousness for scientific purposes of s 


a 


m is approached. 
926) Lewin went to some pains to point out the 
aying that “everything depends 


€ Guthrie (1935), page 163. 
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upon everything else.” While as a gestalt psychologist he was concerned 
"with wholes, he recognized as the most important problem the isolation of 


the “specific psychical units, personality spheres, and behavior wholes” in 
which one’s activities, emotions, intentions, wi: 


bedded. Whatever psychical unity there may b 
a “weak” gestalt, and we are dealing in the personality with a great number 
of “strong” gestalts which in part are in communication with each other, in 
part disclose no genuine unity at all. In adults the Possibility of A thetanized 


behavior depends upon a relatively complete segregation ofan 16 stionr of 
different tension systems. umbe 


shes, and hopes are em- 
e in the “ego” system, it is 


ie Wgh 
en. He arrives at the con A nowa B 
, because their satisfaction is like the sa i l, E is 


stitute satisfactions may be fo 
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TOPOLOGICAL, VECTOR, AND HOD 
PSYCHOLOGY 

The more abstract and formal chgracteristics of Lewin’s system were 

detailed in a book and a monography Principles of topological psychology 

(1936), and‘The conce F i 


ptual representation and measurement of psycho- 
logical forces (1988). He later stated the re] 


‘i eee ‘ationship of his viewpoint to 
problems of learning. 


OLOGICAL, 
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for a given set of data. In looking for a “fit” to his conception, Lewin hit 
upon topology and upon vectors. 

Topology is a non-metrical geometry of spaces, in which concepts 
such as “inside” and “outside” and “boundary” are used. Lewin believes 
that many psychological situations are structured in that way) Sociologists 
have long talked of “in-groups” and “out-groups.” It is such a usage which 
Lewin is trying to formalize in topological concepts. 

Vectors are borrowed from the mathematical system used in me- 
chanics to describe the resolution of forces. Unlike topology, vectors are 
metrical, A vector is usually represented by an arrow, the length represent- 
ing its force, the direction representing the line of application. The point of 
application may also be shown as a matter of convenience, but it does not 
belong as a property of the vector. Because much dynamic psychology can 
be described in terms of conflict and the resolution of conflict, the geometry 
of vectors finds appropriate application. 

His desire for a “qualitative” geometry having some of the character- 
istics of topology with a “quantitative” geometry of vectors led Lewin to 
suggest that a new geometry was needed to fit the problems of psychology. 
He specified some of its characteristics, and named it the geometry of 
hodological space. The root is from the Greek word hodos, meaning “path.” 
Hence hodology is to be a geometry of paths, somewhat as topology is a 


geometry of spaces. f 

-What Lewin intended by borrowing from topological and vector 
geometries is clear. He was searching for a mathematics which would be 
appropriate to psychological structure as he understood it. He has been 
attacked for using these mathematics incorrectly and for drawing im- 
proper analogies from them.$ His system is no doubt vulnerable in some of 
these respects, but the relevance of his theories for the purposes of the 
s not rest upon the validity of his quantitative rep- 


present discussion doe 
resentations. 


The Psychological Field: Life Space 
The selection of geometry as an appropriate mathematics is natural 
because the psychological field is thought t of as a space in which the person 
moves. This space is psychological, not physical; it is represented mathe- 
matically by a spatial diagram, but that does not mean that it exists in 
physical space. 
The conception of life spac 


8 London (1944). 


e is a plausible onel It signifies that two 
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people walking down the same street are going different places, and the 
worlds in which they are walking are to them different worlds. Or as I sit 
in reverie and make plans for tomorrow, I move in a world very different 
from that in which I sity life space is the space in which I live psycho- 
logically, as seen from my own viewpoint. It includes each and every ob- 
ject, person, idea, that I have anything to do with.jIt corresponds in many 
ways to the world about me, to the world of things, people, and ideas, 
being my selection from these, but it becomes my world always in edited 
and distorted forms. It is possible to distinguish between me and my life 
space, for I move about in my psychological space as others do in theirs. 

e interact in the real world, all the while producing changes in our re- 
spective psychological worlds. (Because of correspondences between the 
psychological world and the real world, Lewin spoke of life space as quasi- 
physical, quasi-social, and quasi-conceptual.\ Life space is conditioned 
and influenced by the physical, social, and conceptual environment but 
cannot be identified with that environment. ` 


Life space may be inferred; it does not d 
That is, although life space is psychological, 
available only through introspection. A pers 
spect about the forces acting in his life space 
is a construct, like other scientific construct 
psychological situation at a given moment. 


FIGURE 35. Life-space. The situation represented is that of a boy who wants to 
become a physician. The person (P) is shown separated from the goal region (4 G) 
by a number of regions with boundaries between them. The regions are college en- 
trance (ce), college (c), medical school (m), internship (i), establishing a ek 
(pr). While the locomotion Hiong 


h these regions may take place in the 
are in the present life-space as obstacles to the attainme 


ission from Principles of topological psychology Wen, YH 
in view. By permission from Principles of topo ogical psychology, by K, ` 
right 1986 by McGraw-Hill Book Co, Tae Psychology, by K. Lewin, Copy. 
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motions are not to be identified; in life space we represent the psycholog- 
ical significance for the person of the motion in physical space. A typical 
life-space diagram is shown in Figure 85. aa 

Occasionally life space corresponds very closely to the real world 
with which the person is in commerce. In that case he is said to be in 
touch with reality. At the other extreme, we find individuals living in a 
world of fiction and fantasy—out of touch with reality.(In order to describe 
these differences, Lewin proposes reality-irreality as a dimension of life 
space. The very young child may not distinguish between his hopes and 
wishes and the actual circumstances of life. Presumably there is a greater 
differentiation between reality and irreality as we grow older, although 
normal adults are still capable of wishful thinking. 


“The Person in Life Space 
The person is often represented as a point moving about in his life 
space, affected by the pushes and pulls upon him, circumventing barriers 
in his locomotion in his own life space. If the person is to be treated as a 
particle in a field, this representation as one point in space is a convenient 


one, 
resented in a different way, as a differentiated 


The person is also rep! 
region in life space, the person having a structure of its own. There are 


more superficial and deeper (less accessible) layers of the personality; 
there are different degrees of complexity of organization and differentia- 
tion; some parts are in communication, while other parts have strong bar- 
riers between them. The structure of the person is sometimes diagrammed 
as in Figure 36. As Leeper ° points out, a great deal of the experimental 
work of Lewin and his students has dealt with constructs related to the 
person, including studies of interrupted tasks, satiation, anger, and rigid- 
ity, 

ot very clear about the exact relationship between the 
ace. Deutsch * believes that Lewin used the term 
“person” in three ways, though Deutsch thinks Tain might not have 
agreed with his analysis. One use is as the behaving person in life space, 
essentially a geometrical point upon which the forces diagrammed in life 
space impinge, determining its locomotion. A second use makes the prop- 
erties of the person, in interaction with the evinces, produce life 
space, His needs, beliefs, and values, determine how life space will be 


Lewin was n 
person and his life sp’ 


® Leeper (1943), page 62. A summary of earlier work was called A dynamic theory 


of personality (Lewin, 1935). 
19 Deutsch (1954), pages 190-191. 
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structured for him. The third possibility is that the person and life space 
are equivalent, being two ways of representing the same facts. That is, 
the diagram of Figure 35 represents the changes in the psychological en- 
vironment concurrent with the changes in the tension systems shown in 
Figure 36, and these are two aspects of the psychological situation at the 
time. Lewin’s own preference would probably have been to combine the 
first and third of these ways of considering the person, that is, he would 
conceive the “person” as the behaving person, but he would have the be- 
having person represent at once a differentiated region in life space, 


FIGURE 36. Representation of the person. The motor and 
because they have commerce with th 


boundary zone (M). The p i i inner-personal region 
(I), which has more centra parts (c) and more peripheral parts (p). Under dif 


ent circumstances of stress, th 
dedifferentiation (“primitivatio 


chology, by K. Lewin, Copyright 1936 by McGraw-Hill Book 
_ Valence 


When a person is attracted by an object that ob 
positive valence, an analogy with the 
The person tends to move toward a regio 


pelled. That is, objects and activities thai 
valence, and they lead to abient activity. 
Because (life space may contain re 
at once, these give rise to conflict, especially when the o 
are approximately in balance. Lewin specified three chie: 
flict; 1 
vi. Two positive valences. A child has-to choose, for example, between 
going on a picnic and playing with his comrades, 


11 Lewin (1935), pages 88-91. The examples cited are his, 


gions with several valences active 
~~ vale 


Pposing forces 
f kinds of con- 
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a% A simultaneous positive and negative valence. A child is offered 
a reward for a school task he does not wish to perform. 
ő. Two negative valences. A child is threatened by punishment if he 
does not do a task assigned to him. 
The experimental analysis of conflicts of these types has proven fruit- 
p.s 
The assigning of the valence to the object represented in life space 
does not, of course, imply that the valence actually inheres in the object. 
an inverse-square law applies to valence (“the force 
of attraction varies inversely as the square of the distance”), the theory 


requires that the distance must be measured psychologically, just as the 
object as perceived by the person.(The same lee 
lences for different people because each of 


While something like 


object carrying valence is the 
object will carry different va 
them perceives it differently. 


Distance and Direction 

Life space is quasi-physical, so sometimes there is a gross corre- 

spondence between locomotion in life space and in physical space. A 

conflict between two objects of which one is on the person’s right and 
n in opposite directions, will be repre- 


One on his left, inviting locomotio š ; 
sented in a life-space diagram by regions and paths in opposite directions. 
When physical space and life space are in close correspondence, physical 


distances and directions may be used for experimental purposes as ap- 
Proximations of distances and directions in life space (that is, Euclidean 
Space is used then to represent hodological space). 

But life space often does not correspond to physical space. A choice 
that is arrived at through reflective thinking is a locomotion in life space 
quite as genuine as a locomotion involving gross muscular movement. 
In such cases, where there is no physical counterpart of locomotion, the 
Specification ot distance and direction becomes very difficult. 

Lewin had little to say about distance. If one path includes another 
on the way to a goal, it is evident that the included path is shorter than 
the entire path.?* That is all that we can say about distance, except in cases 
in which we assume a correspondence between environmental (physical) 


Space and life space. 
_ _ Lewin gave a fuller trea 
is defined by the start of the p 


tment to direction. The direction of a path 
ath, “by the beginning step of the distin- 


s (1938a) (1933b), and the later ones by 


12 l aes Fajan: 
See the early experiments of Faj land (1941), Barker (1942) (1946). 


Hovland and Sears (1938), Sears and Hov 
18 Lewin (1988), page 50. 
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guished path.” 1+ There are four possible directions of locomotion. The 


letter d, with appropriate subscripts, serves to symbolize the path whose 
direction is defined: 

(1) The direction of the distinguished path from A to G would be dac. To 
indicate a particular route from A to G, the expression might be d4 n.;, the ad- 
ditional intermediate letter or letters indicating distinctive portions of the path. 

(2) Sometimes the person tends to stay in the present region (continue his 
present activity). The direction of this tendency is d4 4. 

(3) Conversely, the direction of a tendency to leave the present region in 
so far as this tendency involves a disregard of what other regions are moved 
toward, or the direction of a tendency not to enter 


the region A, would be dy as 
(4) Finally, the tendency sometimes is to increase the distance from a not- 
present region. This tendency has the direction da,- 


Lewin showed the applicability of his method of representation to a 
variety of experiments often studied in the learning laboratory: maze 
learning (familiar and unfamiliar mazes, mazes with one-way doors, 
T-mazes), the roundabout-route experiments, the use of tools, the obstruc- 
tion box, extinction of lever-pressing, goal-gradient behavior, various 
conflict problems. Whether or not his theory is a “learning theory” its 
relevance to learning is established by these applications. 


An Example of the Representation of Experimental Results 


full (1933) and Leeper (1935a) performed similar experiments in 

which they taught rats to turn one way through a maze when hungry, 
another way when thirsty.)A rather slight difference in physical arrange- 
ments made an enormous’ difference in the ease of acquiring the habit. 
Hull's rats took 25 periods of 8 days each (a total of 1000 trials at 5 trials 
per day) before achieving about 80 per cent correct choices. Leeper’s 
rats performed as well after a single 8-day period (a total of 40 trials) 
Hull's animals ran to food or water in a common goal-box, finding the 
door closed if approached from the wrong side. When hungry they could 
enter the goal-box and find food only if they ran to the left; when thirst 
they could find the door open and water in the goal-box only if they ran 
to the right. (I am disregarding the appropriate balancing out of right 
and left in the careful design of the experiment.) Leeper’s rats ran in- 
stead to separate end boxes on the food days and the water days. Both 
boxes were open, so that if they ran the wrong way on a water da 
they found food in the end-box they entered; conversely, if they took 
the wrong path on a food day, they found water in the box entered, 

14 Lewin (1938), page 27. 

15 The summary is quoted from Leeper (1943), page 125. 
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Let us now see how Lewin diagrammed this diference 

Lewin chose to represent the two experiments by diagrams spatially 
similar to the actual experimental arrangements, slightly stylized by con- 
verting mazes with right-angle turns into circular ones (Figures 87 and 
38). There is great freedom in the construction of topological or hodo- 
logical representations. Hull's problem has to be represented, however, as 


Gf Gw 


FIG. 38 


FIG. 37 


9 E ation of the situation in Hull’s experiment (Hull, 1933). 
Piaf sie ies box; Gr, food goal, reached on hunger day by path A, 
E,B; Gw. water goal, reached on thirst day by path A,D,B; fue , force generated by 
valence of food; fosy force generated by valence of water. Entrance from the in- 
correct side was blocked off by a closed door at the goal-box. (Modified slightly from 


Lewin, 1938, page 182.) 

F! wntation of the situation in Leeper’s experiment (Leeper, 
ime 58: Represo T goal box in which food was also present, whether the 
animal : Rance thirsty; L, goal box in which water was always present, whether 
the adima] ae ry or thirsty; Gr, food goal, reached by path A,E,K; Gw, water 
goal, a yas nen A,D,L; faces force generated by valence of food; fa,cy, force 
Seren bpp of "water. Entrances to both goal-boxes always open. (Modified 
slightly from, Lewin, 1938, page 182 


two paths to a common goal (Figure 87), while. Leeper's must be repre- 
sented as two paths to different goals (Figure 38). Leeper S problem could 
have been diagrammed as a T-maze or @ Y-maze peenis doing violence 
to its hodological representation. The important s nr between the 
two figures lies in the forces set up by the highly valence goals: 


is instructive to compare his explanation with 
10-i and that by Hull (1952a), page 140. Both 
fluence to the “Jatent learning” resulting from seeing 
en not thirsty. In his more formal treatment, Hull 
Its were alike, thus ignoring the 25:1 difference in 


tal arrangements. 


16 Lewin (1938), pages 
that by Leeper (1935a), pages. 

eeper and Hull attribute some in! i 
‘ood when not hungry and water wi 
wrote as though the Hull-Leeper res 


difficulty between the two experimen 
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Of the differences between the experiments, Lewin writes: 


For determining the direction of the forces facr and f al : 
the experiment of Hull, there remains the question of determining the relation 
between the directions dy p and dy p. In Leeper’s experiment both are clearly 
opposed. In Hull's experiment we have to deal with a rather complicated rela- 
tion between the directions of two paths from the same starting region to the 
same end-region. By discussing the geometry of the situation we have seen that 


those directions have to be considered “partly equal.” That would mean in our 
case: 


‘A,Gyy at the point A in 


dap (=) dax 


which involves both a certain amount of 
tions. We expected that the difference of 
portant the greater is the potency of th 


equality and difference of these direc- 
both directions should become more im- 


e means, relatively to that of the ends. 
Indeed, I would assume that one of the reasons for the difficulty in Hull’s experi- 
ments is that the regions E and D are relatively slightly emphasized means to a 
common end, 


At any rate, the directions dap and dy x are not clearly opposed to each 
other, as in Leeper’s experiments, In addition, the force fa.cp in Hull’s experiment 
points in two directions, dy » and dy». Also the force fa,cw points in these two 


directions. In Leeper’s experiment each of these forces points only in one of 
these directions,17 


Apart from pointing out the obvious differences i 
rangements, Lewin proposes two explanations for the 
ing by Leeper’s rats than by Hull’s: (1) the two 
ated for Hull's rats, and (2) the forces deter 
starting point are more nearly in balance 
Lewin’s explanation fails to tell us how 
number of features of the experiments; 


n experimental ar- 
greater ease of learn- 
paths are less differenti- 
mining the choice at the 
e for Hull’s rats. Unfortunately, 


he would propose to handle a 


How does experienced goal satisfaction or frustration 
structure on the next trial? 

How does the state of hun 
rat, or the forces or v 

Of what import: 

If the forces ac 
solve the problem? 


affect the cognitive 
ger or thir: 
alences operating? 
ance are the closed doors in Hull's experiment? 18 

ting on Hull’s rats are in balance, why do the rats eventually 


st modify the cognitive structure of the 


Lewin succeeded in makin 
difficulty for Hull’s rats than fo 
in any detail for the findings, 


g it plausible that there should be greater 
r Leeper’s, but he clearly did not account 


One can only conclude that, if other ex- 


17 Lewin (1938), page 183. 
18 Lewin noted their importance, and ave an illustration fro, 


een Hull’s experiment and Li 
Lewin (1938), page 181. But the closed doors do not ap 
the experiment. 


rom human behavior to 
eeper’s seem less paradoxical, 
pear in the formal treatment of 
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planations are as incomplete as this one, distance and direction are left in 
an unsatisfactory state.*® 


Behavior as a Function of the Present Life Space 


Perhaps even more forcefully than his gestalt colleagues, Lewin in- 
sisted that behavior depends upon the present, not upon the past or the 
future. Past events, like future events, do not exist now, and therefore in 
effects now. While past psychological fields are part 


his sense, cannot have 
resent field, their relationship to the present is so 


of the origin of the p 


indirect that their explanatory value is slight. 
As pointed out earlier, the notion of contemporary causation is a 


sound one, and need not lessen the importance of a knowledge of the past 
in contributing to an understanding of the present. It is true, however, 
that in adopting a logical position there often go along with it certain 
overtones of interest and preoccupation not essential to the position. 
Lewin’s whole-hearted insistence on the importance of the present ac- 
centuated his interest in conflict and other contemporaneous motivational 
situations, and lessened his interest in progressive changes, such as those 
studied in conventional learning situations or in the psychological clinic. 

Lewin justifies his neglect of learning on different grounds. He says 
that learning (i.e., the influence of the past on the present) has been much 
overemphasized as a reaction against earlier prescientific teleology (i.e. 
the influence of the future on the present). Field theory’s emphasis on 
the present he believes to be a necessary corrective, establishing a pre- 


ferred balance of interest.” 


The Time Dimension 

Despite his preference for dealing with present causes, Lewin was 
not, of course, oblivious to the time dimension. Decade learning theorists 
tend to be preoccupied with changes that take place in time, it is of in- 
terest to examine Lewin’s position, 

In a discussion defending the principle of contemporaneous causa- 
tion, at the same time showing that he was aware of historical factors, 


Lewin had this to say: 

A z inciple of contemporaneity as one of the basic 
T Without ae Ae PE E to realize that to determine the psychological 
Le ioe e Tacit of behavior (i.e., what is usually called the “meaning of 
the payeuai geal events we have to take into account in psychology as in 


19 This conclusion was reached also by Leeper (1943), be then suggested im- 
provements that would require major changes in Lewin’s system. 
20 Lewin (1942), page 222. 
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i i ds in psychology 
in ti i length of this period depen: sycholog, 
ST ba eean fie ee a male, the EA ne ae etuaan F 
aa enopes i the longer is the period which has À 
saa ana el ea me velocity of behavior at a given = 1 l 
pe epted from L. K. Frank (1939) the dimension of life pe 
in acc , : 
att A ieee which includes the psychological an i ip 
a a cme The small child lives largely in the present, an 
psycho : 


i dimension in life space. The sequence of events starts at 
AYE tig Note the movement of the person (x) along path ps 
ag a segment of time, and along path pS in a larger segment of time. Lewin 
in a 
(1943), page 302. 


he grows he conforms his behavior to events in the m 

Part of the overemotional reactions of adolescents m 

attempt to face adult problems before they have sufficie 

Then they interpret a momentary frustration as an e 
21 Lewin (1943), page 301. 


ore remote future, 
ay be due to their 
nt time-perspective. 
nduring defeat, 
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matic device was found for adding the 
time dimension to life space. One kind of diagram resembles those used 
to show the pathways through the spinal cord. Then the time dimension 
can be thought of as the long axis of the cord, the cross sections corres- 
ponding to momentary life space. Such a diagram is given in Figure 39. 
Another diagram presents life space captured at various segments 
of time, as though life space moves along on a continuous belt and can be 
examined at any moment. Such a diagram, for life space in relation to a 
planned future event, is given in Figure 40. In this diagram there appears 
also a vertical dimension, indicating the difference between life space on 
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No very satisfactory diagram 
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FIGURE 40. Time perspective in planning. The sequence of events starts at the 
left and moves to the k; sht. Each segment is as seen by the person at the time repre- 
sented. The vertical Jimension is that of irreality (wish; fantasy). The coming to- 

. The vertical d s t there is less difference between 


ether of t > Jevels in the future means tha! s e 
reality and ee = something planned is more nearly achieved. Note that at earlier 


stages the wish makes the person appear to be nearer to the goal in irreality than in 
akes ; j 
reality, From Barker, Dembo, and Lewin (1941 ), page 210. 


and on the level of reality. Note that the 


the plan i au (fantasy) 
ane of irreality (fantasy? 
I irreality ( ars in present life space, but life space 


future, as now perceived, appe 
itself changes as time moves 0” 


‘AppLICATIONS TO LEARNING 


ange in Cognitive Structure 
reases in knowledge. What does it mean to 


know more? It means to have à more highly differentia ted_life space, in 
. nnected by defined paths. That is an- 


which there are more subregions cor : i ; 
other way of saying that we know facts in their relationships; we know 


what leads to what. i i 
A problematical situation represents an unstructured region of life 


space, We do not know how to get from the givens to the goal. We feel 


wit 


Learning as a Ch 


As one learns, one inc 
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insecure until the region becomes structured. When it does become struc- 
tured so as to permit problem solution, we have learned. 

a change in the structure of knowledge (cognitive structure) may 
occur with repetition, The situation may require repeated exposures before 
the structure gets changed. The important thing is that the cognitive 
structure gets changed, not that the repetitions occur. With better arrange- 
ments of the problem the structure may get changed with fewer repetitions. 
This-is the lesson of the insight experiments(Too much repetition does 


not aid learning; on the contrary, repetition may lead to psychological 


satiation ° with accompanying disorganization and dedifferentiation of 
the cognitive structure. ) 


Changes in cognitive structure come 
principles of patterning in perception, 
changes are due to “forces” 


about, in part, according to the 
as previously discussed, These 
intrinsic to the cognitive structure, But 
cognitive structure is also changed according to the needs of the individual. 
A psychological force corresponding to a need can have either of two 
consequences. It can lead to “locomotion” in the direction of the force. 
This means, in common language, that the need is satisfied in familiar 
ways; new learning is not required, and the cognitive structure m 
intact. Or the force can lead to a change in cognitive struct 
such a “locomotion” may be facilitated. That is, the relations 
the situation are seen in new ways, so that the need may be 
the latter case, motivated learning may be said to have taken 


ay remain 
ure so that 
hips within 
satisfied, In 
place. 

VA Reward and Punishment 


Those who accept law of effect or reinforcement theories have seldom 
analyzed in detail what the circumstances are which force the learner to 
confront the reward or the threat of punishment. Lewin’s treatment of 
the conflicts involved is recognized by at least one of his critics as an 
advance over traditional law-of-effect thinking.” 

(The usual arrangement in punishment is illustrated in Figure 41. 
The threat of punishment (Pu) is used to keep the learner at an intrin- 
sically disliked task (T). A conflict is set up so that the individual is forced 
to choose between one or the other of the disagreeable possibilities, Under 
these circumstances, the tendency is to “leave the field,” to avoid both 
of the tasks. In order to keep the learner in the field of the conflict it is 
necessary to erect barriers (B). These in actual life are usually authoritar- 
ian; it is necessary to “police” a learning situation controlled by punish- 


22 Karsten (1928). 
23 Estes (1954b), page 340. 


‘to leave the field is not present as it is in 
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ment. Because association psychologists have tended to perform their 
experiments in confining runways or with subjects strapped to the ap- 
paratus, they have usually overlooked important dynamic arrangements 
of their experiments. The importance of the barrier is not incorporated 
in their theories. 


BARRIER 


FIGURE 41. Life space in threatened punishment. The person is caught between 
the tendency to onl ai unpleasant task (T) and the threat of punishment (Pu). 
Because of the conflict there will be a tendency to leave the situation (1) unless 
there are strong barriers to prevent this. Hence the punishment situation to be effec- 
tive, is necessarily prison-like. ‘After Lewin (1942), page 233, by permission of the 


National Society for the Study of Education. 


In a reward situation it is not necessary for the learner to be “walled 
in” because the attractiveness of the reward keeps him in the field. It is 
necessary, however, to keep a barrier around the reward, to prevent access 
by any route (such as W) other than by going through (performing) 
the disliked but requested activity (Figure 42). Because the reward is 
externally related to activity T, there will always be a tendency to take 


is i i 7 he 
F x in offered reward. The person is in conflict because tl 
ap ae ee ake Doe by ae of the unpleasant task (T). While the tendency 
ach to rewded RNASE punishment, there is a tendency to approach 
i by path Wagn. Hence it is 
e r hortcut so as to avoid T, as by 

TE DADE roe reward to prevent approach by any path other than 
that through T Some policing is necessary, but the situation is less prison-like than 
inh erence = unishment. After Lewin (1946), page 811, reproduced 
y permission from Manual of child psyc 

y John Wiley & Sons, Inc. 


chology edited by L. Carmichael, published 
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short cuts, if possible. Cheating by school children is such an effort to 
obtain the reward without performing the demanded work. 

The differences in structure between reward and punishment situa- 
tions are real. One consequence of the different amounts of constraint in 
the two situations is that(rewarded activities often become interesting and 
liked, so that motivation is no longer extrinsic, whereasfactivities controlled 
by the threat of punishment tend to become increasingly hated. ) 


Success and Failure 


A When goals are intrinsic, it is more appropri 
activity as successful or unsuccessful rather than as 
The contrast between a “psychological” approach to psychological 
problems and a “non-psychological” approach is nowhere clearer than 
in the difference in treatment of reward and success. @ „reward is some- 
thing tangible and external, which merely terminates a situation (Guthrie) 
or is associated with need reduction (Hull). A success experience has to 
be understood according to what the learner is trying to do, and (he 
relationship between success and goal-achievement is a somewhat com- 
plex affair. Psychological analysis offsuccess from the point of view of the 
learner shows at least the following possibilities: 25 
1. To reach a goal constitutes success, Thi 


of success. It is obviously satisfying to try for something and to achieve it, 
“2. To get within the region of the goal may be a success experience, 


Some goals are less well defined than others, A student may hope for an 
A grade, but be satisfied with a B; a person may wish an office, but take 
satisfaction in the fact that his fellow-members nominated him. The 
curious practice in India of signing a letter with the 


the signature “B.A., failed,” testifies to the fact that getting near enough 
to the goal to take the examination is a mark of esteem. 

3. To make noticeable progress toward a goal m 
experience, even though the goal is remote. To pa 
is not to be admitted to medical school, but it is a ste 

4. To select a socially approved goal may in i 
perience. To be the kind of student who carries 
Christmas holidays is a mark of serious intent and i 
even though the books remain unopened. To own 


ate to think of the goal 
rewarding or punishing. 


s is the usual interpretation 


statement following 


ay provide a success 
ss freshman physics 
p along the Way. 
tself be a success ex- 
books home over the 
s bolstering to the ego, 
an encyclopedia is to 
1 Lewin (1942), page 233. For a fuller discussion of 


promised reward ; at- 
ened punishment, see Lewin (1935), pages 114-170. rd and threat 
26 Hilgard (1942), 
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be the kind of person you would like to be, whether or not the encyclo- 
pedia is consulted. 

If such matters are talked about in the language of primary and 
secondary reinforcements, some of the vividness of the learner-goal re- 
lationship is bound to be lost. It is to Lewin’s credit that he made a serious 
effort to deal with psychologically real problems, and then, as a second 
step, tried to find a mathematics appropriate to them. Too often the al- 
ternative is chosen of looking first for something which can be measured 
and counted, hoping that psychologically more interesting relationships 
can be deduced from them later on. This leads to preoccupation with 
nonsense syllables, mazes, and conditioned responses, to the neglect of the 
problems of behavior in freer life situations in which the learner selects 
his own goals and redefines them as he goes along. 

The experiments which pointed up the concepts used by Lewin and 
his students in discussing success and failure were those of Hoppe (1930) 
which introduced the companion notions of(ego involvement and level of 


aspiration. } 


(Psychological success and failure depend upon ego involvement in 


the task at hand. That is the goals must be real to the learner, so that, if 
achieved, there is the elation of significant accomplishment; if not achieved, 
there is a chagrin or humiliation of defeat. As William James put it, our 
self-esteem may be expressed as the ratio between our success and our 
pretensions.”° 7 r ; i spe 
Hoppe pointed out that there is a restricted region of difficulty within 
which success and failure are possible (Figure 43). Some tasks are much 
too easy. To succeed in them is not to experience success. A school child 
is insulted if asked to spell words which are too easy or to make computa- 
tions interpreted as “baby stuff.” Some tasks are much too hard. If we 
are asked to give a € onstruction in Russian grammar—and we make ule 
pretense of knowing Russian—this is not a psychological failure. i is 
only within the range of uncertainty—where both Snocess and failure 
are possible—that we can really succeed or really fail, Experienced teachers 
know how hard it is to keep tasks S A appropriate level of difficulty so 
y ae ain ego-involved. 

that ba des ons oa zet P ES goals within the range of activities 
ego involvement. The momentary goal i$ referred to as 
g! fit is set according to the learner’s interpretation 
t, but there are wide individual differences in 


in which there is 
the level of aspiration. 
of his own achievemen 


26 James (1890), page I, 310. 
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the manner of such goal-setting. Some learners are realistic, and set their 
goals near to what they have shown themselves capable of doing. Those 
who are unrealistic may depart in either direction from their past achieve- 
ments. Some are self-protective, and to avoid the possibility of failure set 
goals too low; others are hopeful, and set goals unrealistically too high.*) 


Requires Achievement 
Much too 
difficult 
Very difficult 
Zone of task level and level of Difficult Limiting zone of 
aspiration within which experiencies Medium individuals. capacity 
of success [3] and failure Ẹ] occur toe y 5 v: 
Much too j 
easy F: l; 
FIGURE 43. Difficulty-level and the experience of success and failure. The sycho- 
logical experience of success and failure can occur only in the region in which ‘there 


is some possibility of objective achievement or lack of achievement, By permission 
from A dynamic theory of personality, by K. Lewin, Copyright 1935 by McGraw- 
Hill Book Co., Inc. 


The interrelationships among past achievements, 
social atmosphere, and individuality, set a number of inv 
problems.” The flavor of these studies is legitimately c 
of studies on reward, punishment, and reinforcement, which represent the 
approach to the same subject matter taken by the stimulus-response psy- 
chologists. Regardless of how one may view the conceptual system within 
which the level-of-aspiration studies are framed, it is evident that they 
call attention to aspects of goal-directed activities which have been neg- 
lected by those preoccupied with reward and punishment, 


27 P, S. Sears (194G) (1941). 
28 For reviews of experiments concerned with level of 


aspiration, see J. D, Frank 
(1941) and Lewin, Dembo, Festinger, and P. S. Sears (1944). J ae 
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Learning as a Change in_Motivation id 


The repetition of an activity brings changes both in the cognitive 
structure and in the need-tension systems. Often a desirable outcome is 
a change in the interests or values of the learner, that is, a change in the 
relative attractiveness of one goal over another. This goal-attractiveness is 
what Lewin calls valence, and valences change. i 

The following three cases of valence-change as a result of repetition 


are to be distinguished: 

-1 Attractive goals may lose their attraction if the activity related 
to them is repeated to the point of satiation.)Monotony and boredom are 
familiar enough experiences, but their dynamics are by no means simple. 
The suggestion that every activity presents a barrier to its own repetition 
(one of Hull’s postulates ) does not suffice to account for the details that 
are found. For example, when tasks are graded as pleasant, unpleasant, or 
indifferent and then repeated excessively, the unpleasant tasks, as might 
be expected, are first satiated.” But it is the pleasant tasks which next 
become impossible to repeat, and the indifferent tasks are consistently 
last. Because of less ego involvement in the indifferent tasks, their endless 
repetition does not develop the resistances found for either pleasant or un- 
pleasant activities. Psychological satiation is not to be confused with 
fatigue, even though fatigue symptoms are prominently present. Karsten 
showed that the symptoms often disappeared with a change of set, even 
though the same muscular activities might be involved in the new task. 
For example, a subject who had to quit a line-drawing task because of 
inability to hold the pencil any longer was asked to write out a description 
of the psychological processes which had been involved in the perform- 
ance, and was able to pick up the pencil and write about the performance 
with ease, including demonstrations of the task which no longer could 
be performed! 

2. (Goals originally unattractive may become acceptable through a 
change in the meaning of the goal-related activity. There are many teach- 
ing techniques which depend upon this principle. Lewin gives a number 
of examples.*° One is the changing of food preferences by telling a story 
in which the disliked food is the favorite for the hero.) This is a common 
radio device for selling breakfast food. Another illustration is the changing 
of ideologies, as in the experiment of Bavelas (1942) in which relatively 


2° Karsten (1928). 
30 Lewin (1942), pages 935-236. 
81 Duncker (1938). 
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autocratic recreation leaders were changed into democratic ones. It was 
necessary for the cognitive structure of the field, “leadership behavior,” to 
undergo changes so that what they were trying to do as leaders became 
changed. 

3. The choice of goals is influenced by previous experiences of suc- 
cess and failure. This is most clearly shown in the experiments of Sears * 
in which she found that realistic goal-setting was characteristic of those 
who had experienced success. It is possible to attenuate the bad con- 
sequences of failure by teaching more appropriate adjustments to it, thus 
raising children who can tolerate frustration.** But this does not alter the 
fact that in addition to the problem of how goals affect behavior there 
exists the very important problem of how goals are chosen. 


The facts of amnesia show that forgetting and retention_ 
pletely understood according to our ordinary theories of forgettin 


are_incom- 


ting through 
disuse or retroactive inhibition. That there are motivational and emo- 


tional factors at work has been emphasized in the Freudian theory of 
repression.** It is in this context that Zeigarnik’s (1927) experiment on 
the recall of finished and unfinished tasks is to be considered, for the 
experiment dramatically relates forgetting to tension systems within the 
individual as a demonstration of Lewin’s (1926) theory. 

The subjects ( variously students, teachers, children, 256 in all) were 
given simple tasks to perform. In the individual experiments there were 
usually 18 to 22 such tasks: modeling a figure from clay, laying out playing 
blocks in certain forms, solving simple puzzles, and so on, The experi- 
menter interrupted the subject on half the tasks, waiting until the subject 
was deeply engrossed in it so that it would be a psychologically real in- 
terruption. Which tasks were interrupted differed from subject to subject 
according to conventional experimental practice, 

After “tidying up” in order to remove all tools and other reminders 
of specific tasks, the experimenter asked the subject: “Please tell me 
what the tasks were upon which you worked during this experiment.” 
Often a number of tasks were mentioned, then a pause would occur, After 


3? P, S. Sears (1940) (1941). 
33 Keister (1937). 
34 E.g., R. R. Sears (1936a), Rapaport (1942) (1950). 
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considerable effort a few more tasks might be recalled. For purposes of 
scoring, only those tasks were counted which were readily recalled, that 
is, before the pause. The difficulty of such recall is greater than might be 
supposed. In one series, 32 subjects recalled between 7 and 19 tasks out 
of 22 worked on, averaging 11, or just half the total; in another series, 
14 subjects recalled between 7 and 16 of 20 tasks, averaging 10, again half 
the total. 

When the relative success of recall for completed or interrupted tasks 
was examined, it was found that of the first 32 subjects, 26 remembered 
interrupted tasks better than completed ones, 3 remembered both equally 
well, and 3 remembered the completed tasks better. 

A scoring ratio was devised to indicate the relative superiority of in- 
terrupted tasks recalled (I) over completed tasks recalled (C). If both 
were equally recalled, the ratio would be I/C = 1.00. In the first experi- 
ment the ratio turned out to be 1.9, indicating that the interrupted tasks 
were recalled 90 per cent better than the completed ones.** Most of the 
other ratios obtained from groups were in the same direction, that is, 
above 1.0, although there were wide individual differences. 

Zeigarnik went on to test several hypotheses to account for the greater 
retention of the unfinished tasks, and also to determine the circumstances 
under which finished tasks would be better recalled. 

“the first hypothesis tested was that the attention value of the task 


might be enhanced by the shock of interruption. When, however, the task 


was interrupted early in the hour and completed later, it was forgotten 


like any other completed task, thus contradicting the hypothesis. 
A second hypothesis was that the subjects expected the interrupted 


tasks to be resumed and therefore made special note of them. However, a 


test situation in which one group was told that the tasks would be re- 
sumed, and another that they would not, failed to result in a difference 
between the groups. : 

The remaining hypothesis, the one accepted, is that a task not com- 


pleted leaves a state of tension, @ quasi-need. Completing the task means 


resolving the tension or discharging the guasi-need. The advantage in 
memory of the unfinished task is due to the continuation of the tension. 

The advantage of unfinished tasks is not to be universally expected. 
Zeigarnik pointed out at least eight conditions under which the relative 


advantage is lost: P : Bins F 
1. If a task, finished to the experimenter s satisfaction, is considered 


ointed out that the ratio of 1.9 is too high because it is an 


Mig Gl (1988) kias p a ratio of tasks recalled it is reduced to 1.6. 


average of ratios. Corrected as 
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inadequately completed by the subject, it may be recalled as well as an 
interrupted task. 

2. Tasks which are without a clear-cut completion stage, such as 
marking X’s on a sheet of paper, have no advantage in memory following 
interruption. Interruption is in such cases not a psychological incomplete- 
ness; it is the mere calling of a halt to an endless activity. 

8. Subjects who do not become at all involved in the individual tasks, 
but act according to command, remember finished and unfinished tasks 
equally well. A group of visiting school children who were set to work 
when they had come merely to see the laboratory were obedient in this 
way, and their recall ratio was I/C = 1.03. 

4. If the tasks are seen as leading to the satisfaction of a single need, 
so that the boundaries between them are weakened, the individual effects 
of interruption are lost. One small group of subjects became engrossed 
in discovering the meaning of the experiment, so that the ind 


ividual tasks 
were imbedded in this search for the secret of the experiment. The ratio 
was I/C = 1.12. Another group of subjects was shown in advance all 


the tasks that were to be solved in the hour. By reducing the separateness 
of tasks the difference between interru 
I/C = 0.97. 

5. If subjects were very tired when 
rested when tested, or rested while 
the memory advant 


ption and completion was lost, with 


performing the experiment and 
performing and tired when tested, 
age of unfinished tasks was lost; in fact, for the tired- 
fresh group, the finished tasks were better recalled (I/C = 0.61). An in- 
genious explanation is offered. It is said that the tension-system cannot 
be maintained in a loosened, relatively fluid medium, hence memory for 
the uncompleted tasks is poor in a tired condition. On the other hand, 


under such circumstances relatively stable systems (finished tasks ) 
sist in spite of the state of fatigue,** 


6. After delays of 24 hours, the I 
there is no longer an advantage for th 
as due to intervening activity, 
servation. 

7. If a highly emotional episode is introduced 
the retest, the same distortion occurs as with th 
subjects tested in this way all showed better reca! 


per- 


/C ratio drops to about 1.00, so that 
e interrupted task. This is interpreted 
not to time alone, because of the next ob- 


between the tasks and 
e passage of time} Six 
ll of finished tasks, 

36 The tests were made after 18-15 hours, when the relati: 


rupted task might have been lost in any case, If the con 
tion after performing while tired should favor the 


ve advantage of the int 

ri er- 
jecture holds, immedi 
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/ 8, A task which the subject found too difficult was often forgotten, 
even though it had been interrupted. The situation is considered to favor 
tepression.** 

The practical implications of Mrs. Zeigarnik’s findings are somewhat 
obscure. In the first place, the memory for the tasks performed is a very 
different matter from memory for what was learned or how to perform 
the task next time. It would be a mistake to assume that children should 
not be permitted to complete assignments, lest they forget them. They 
might forget what they did in the library on Tuesday afternoon and still 
remember what they found out there.* In the second place, the phe- 
nomenon is so temporary, and so easily disturbed by other experiences, 
that its practical meaning is uncertain. It may be that the temporary nature 
of the effects is due to the triviality of the tasks; if that is so, it is important 
demonstration made with more significant tasks. 

One of the more revealing of the later studies concerned with the 
problem of recall of the task is that of Marrow (1938). He explained that 
he was trying out different sorts of materials, and that if he saw the sub- 
ject was getting along all right he would stop him and give him a new 
task. Only if the subject was taking unusually long, or having trouble, 
would he be permitted to finish the task. Under these circumstances, inter- 
ruption was psychologica 
failure, Recall was found to b 


to have a comparable 


lly interpreted as success, non-interruption as 
e higher for the completed tasks, that is, 


those which caused feelings of failure. While wholly in agreement with 
the position that ego-related aspects of the performance are important in 
the recall, the tension set up is not quite that of task completion, although 
it may be related to it in the form: “If only I had another chance at it!” 
The type of investigation started by Zeigarnik has proved a stimulus 
to a great deal of future work.3® Some of the subsequent studies have been 


concerned with a slightly different problem—the resumption of the in- 
terrupted tasks, It is evident from these studies that correlated with the 


Memory for the task is a tendency to resume it when the opportunity next 


87 For a list of 11 variables affecting the results in experiments of this kind, see 
P; + 

rae aed } p) Pt found a tendency for the substance of an interrupted 
tes Cee ey ing) to be better retained than the substance of a completed task. 
Nuttin (1953), pages 341-351, also has shown memory for the substance of the un- 


finisl tention for the names of the tasks. 
mp Ae Prentice (1944), Glixman (1948), McGeoch and Irion 


(1952 1952). Representative studies include Abel (1938), 
Mog ibe? ee (1953), Glixman (1949), Martin (1940), Pachauri 
(1985), Rosenzweig (1938) (1941) (1943), Rosenzweig and Mason (1934), San- 
ford (1946). 
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arises.*° These studies are consonant with the Lewin interpretation that 
a quasi-need has been set up as a result of the interruption. 


ESTIMATE OF LEWIN’S POSITION 


Lewin on the Typical Problems of Learning 


Because his was not essentially a psychology of learning, it is difficult 
to assign Lewin a position on each of the standard problems. A large 
measure of conjecture necessarily enters. 

1. Capacity. The life space of an adult is more highly differentiated 
than that of a child. Similarly, the life space of an intelligent person is 
structured more highly than that of a less intelligent person. There 
also differences in fluidity or rigidity.*? 

2. Practice. Learning may take place with repetition because the 
change in cognitive structure or in motivation may require repetition. 
There is, however, no one-to-one relationship between number of trials 
and the changes which constitute learning. 

8. Motivation. The differences between reward and punishment are 
well represented in life-space diagrams, which point out the difference in 
the amount of policing required in the two situations. This aspect of re- 
ward and punishment has been neglected by all the other writers con- 
sidered. For some purposes it is preferable to speak 
failure rather than about reward and punishment. Th 
ego-involvement and level of aspiration become im 
structure is both activated by and changed by aroused needs or tensions, 
Motivation is therefore of central importance within the theory, 

4. Understanding. Because one of the chief characteriz; 
ing is change in cognitive structure, knowled 
at the heart of learning. 

“5. Transfer, Probably the 
to Lewin. 


6. Forgetting. Motivated forgetting is important. The dynamics are 
rather complicated, with interruption in an ego-involved task Je 
usually to better retention and to attempts at resumption wher 
tunity is offered. However, something like repression is accepted, in the 
forgetting of too difficult tasks, 


are 


about success and 
en the concepts of 
portant. Cognitive 


ations of learn- 
ge and understanding lie 


gestalt concept of transposition is acceptable 


ading 
n the oppor- 


40 Related studies include Adler and Kounin (1939), Cartwright (1942), Nowlis 
(1941), Ovsiankina (1928), Rethlingshafer (1941), Rickers-Ovsiankina (1987), 
41 Kounin (1941). 
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The Emphasis upon Motivation 

The relationship of motivation to learning has been increasingly 
recognized by stimulus-response theorists, but their usual solutions of 
the problem have not been well adapted to deal with the complexities of 
goal-setting, conflict, and problem-solving. The two common solutions 
have been that of the law of effect (and related reinforcement theories), 


and the addition of set and determining tendencies to ordinary associative 


processes, Lewin has made a real contribution by his more dynamic, in 


some real sense more “psychological,” approach to the problem. 

there were others making similar proposals. Lewin was 
undoubtedly influenced by Freud, whose work he thought brilliant but 
methodologically unsound.** In America, McDougall ** was the arch op- 
ponent of behaviorism until gestalt psychology came upon the scene. 
McDougall often inveighed against the errors of hedonism (in which he 
classed the law of effect) in placing too much emphasis on what happened 
in the presence of the goal, and too little on the satisfaction which derived 
from the striving. Woodworth, whose Dynamic psychology (1918) gave 
currency to the notion of drives, included the conception of activity itself 
becoming a drive. This is an anticipation of Lewin’s tension theory, al- 
though Lewin probably was not influenced by it. While McDougall and 
Woodworth were not gestalt psychologists, they saw some of the limita- 
tions in contemporary learning theory which Lewin also saw and at- 
tempted to meet through his experiments and theories. 

A satisfactory psychological theory of motivation is almost certain to 
include some kind of ego reference, that is, some incorporation of the 
present activity within the larger goals, values, or ambitions of the 
learner,‘# Lewin’s motivational theories definitely take account of such 


problems. 


Other Aspects of Lewin’s System 

The problems of Jearning can no longer be neatly segregated from the 
other problems of psychology and limited to the kinds of data that can be 
plotted in the form of learning and retention curves. Hence it becomes 
increasingly difficult to determine what aspects of a psychological system 
may become relevant for learning. In the foregoing account of Lewin’s 
work, emphasis has been placed upon those features that overlap, at least 


Historically 


HA Lewin (1937). 
3 J : 23). i 
E.g., McDougall (1923) d by the contributors to a symposium on the ego and 
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in topic, with what other learning theorists discuss. Here brief notice will 
be taken of certain other features of his system, more remote from the usual 
topics dealt with in the psychology of learning. 

One set of Lewin’s interests might be designated as the dynamics of 
child behavior. This led to the inclusion of chapters by him in successive 
editions of handbooks of child psychology *° and the collection of a book 
of readings on child development by authors much influenced by him.** 
With prominence given to childhood as a period of learning, 
child development becomes increasingly important for learnin 
gists. 

A second set of interests was social psychology, especially group dy- 
namics. An early disciple wrote a social psychology using many topological 
diagrams.*? A number of Lewin’s own Papers were collected in a post- 
humous volume (1951). Again, as the influence of the group upon the 
learning of the individual comes more into focus, studies of this kind may 
become more significant for learning, 


interest in 
g psycholo- 


Related to the interest in group dynamics was a methodological in- 
terest in action research, a technique for getting things done, while at the 
same time criticizing the process along the way. His papers in this field 
have also been collected and published in a small volume with the title 
Resolving social conflicts (1948). The applied psychology of learning can 
profit from the adaptation of some of the principles of action research, for 
example, in encouraging teachers to use newer methods and in g 
the support of the community for these methods,‘ 


Is Field Theory in Learning Related to Physical 
Field Theory? 


Critics of Lewin have objected to a certain gain for the Prestige of his 
theory because he used words such as “dynamic” and “field,” which 
with them implications that the theory is unusually up-to-date and modern, 
and that other theories are old-fashioned and stodgy. 

There are two major criticisms of his identifying his theory as a field 
theory: 

1. If he means simply that he reco 
theory is no more a field theory tha 
theory reco gnizes that behayior is a 
variables. 

45 Lewin (1933a) (1946). 
48 Barker, Kounin, and Wright (1943), 


47 J. F. Brown (1936). 
48 E.g., Hilgard (1953). 
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2. If he means that his theory is a field theory similar to a physical 
field theory, then his mathematics ought to correspond more nearly to 
that used by physicists. 

Lewin answered the first criticism by saying that “the recognition of a 
multitude of factors as determining an event, and even their representa- 
tion as a phase space, does not presuppose field theory.” *° He points out 
that Thurstone’s factor analysis fits the notion of multiple causation but 
is not a field theory. 

Lewin believed that his theory did correspond closely to field theory 
in physics. As it applies to psychology, he stated: 
atements of psychological field theory can be formulated 


r or any other change in a psychological field depends 
d at that time.*° 


One of the basic st 
as follows: Any behavio' 
only upon the psychological fiel 


He believed that the equivalent of the differential in physics (dx/dt) 
was his concept of “behavior,” which, for him, covered any change in the 
psychological field. But this is an inexact statement, for his equations do 
not actually make use of differentials. Hence the precise relationship be- 
tween psychological fields and physical fields is indeterminate. As Estes 
has put it: 
from our analysis of Lewin’s methodology is not that 


ad unsuitable in the treatment of learning but rather 
ecause Lewin did not construct a field theory.5+ 


The main conclusion 
field theory has been four 
that the issue is still open b 

His mathematical representations have been criticized, apart from 
their relationship to field physics. Lewin’s use of vectors and valences 
leading to locomotion in life space represents a courageous attempt to 
find a mathematics with a natural “fit” to the kind of data with which he 
deals, The rather unsatisfactory state of this mathematical structure has 
been pointed out not only by those impatient with his approach, but 
by those friendly to his point of view.®* But the difficulties are inherent in 
the early stages of theory construction and need not prevent appreciation 

in’: iti estions. 
of eae Leta, of the relationship Between mathematics and 
the data of a science is sound, There is no question that some forms of 
mathematics are more appropriate than others. The geometry of spaces 
does usefully represent alternative routes, conflict, ambivalence, barriers, 
and many other psychologically real situations. It is the incorporation of 


49 Lewin (1943), page 293. 
50 Lewin (1943), page 295. 
51 Estes (1954b), page 342- 
52 E.g., London (1944). 
SSE E, Leeper (1943). 
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metrical concepts of distance, direction, and the like which causes the 
most trouble. The usual diagram also best represents the momentary life 
space. The conversion of this diagram into a third dimensional figure 
showing the changes in time—with which learning theories are commonly 
concerned—introduces some awkwardness. As in the case of Hull's math- 
ematical model, a great deal of tinkering is possible in smoothing out 
the difficulties without destroying the essence of what it is trying to 
convey. 
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Chapter 9 


FREUD’S PSYCHODYNAMICS 


aa FREUD (1856-1939) so influenced psychological thinking 
that a summary of theoretical viewpoints, even in a specialized field 
such as the psychology of learning, is incomplete without reference to 


him. In the context of the history of psychology, Boring has this to say 
of Freud’s place: 


It was Freud who put the dynamic conception of psycholo 
chologists could see it and take it: They took it, slowly and with 
cepting some principles while rejecting many of the trimmings, 
that the history of psychology can be written in the next three ce 
mention of Freud’s name and still claim to be a general history of 


gy where psy- 
hesitation, ac- 
It is not likely 
nturies without 
psychology, 
It is no simple task to extract a theory of learning from Freud’s writ- 
ings, for while he was interested in individual development and the kind of 
re-education that goes on in psychotherapy, the problems whose answers 
he tried to formulate were not those with which theorists in the field of 
learning have been chiefly concerned. Psychoanalytic theory is too com- 
plex and, at least at the present time, too little formalized, for it to be 
presented as a set of Propositions subject to experimental testing. There- 
fore, instead of attempting an orderly exposition of a psychodynamic 
theory of learning, we shall rest content with examining in somewhat 
piecemeal fashion suggestions from psychoanalysis that bear upon learn- 
ing. Some of the suggestions have already influenced experimentation in 


learning, and some of them may be more influential in the years ahead, 


PARALLELS BETWEEN PSYCHOANALYTIC THEORY 
AND CONVENTIONAL INTERPRETATIONS OF 
LEARNING 
One can find in Freud’s writin 
very closely statements made by 
1 Boring (1950), page 707. 


gs a number of statements that parallel 
contemporary learning theorists little 
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influenced by him. The common topics within psychoanalysis and learning 
theory serve usefully to show how relevant psychoanalysis is to learning. 


The Pleasure Principle and the Law of Effect 


Hedonistic theories—that man seeks pleasure and avoids pain—are 
‘ons of human conduct, and any theory of 


among the oldest interpretati 
to which they refer. There is 


learning must come to grips with the facts 

no doubt that we can control learning by way of reward and punishment. 

The details sometimes elude us, but the gross facts cannot be doubted. 
Freud's pleasure principle is in accord with these facts, and his in- 


terpretation of the pleasure principle represents one of the first points 


of correspondence between his views and those of learning theorists. The 
corresponding principle in contemporary learning theory is more likely to 
go under the names of the law of effect or reinforcement theory. The broad 
th psychoanalysis and learning theory, is that a 
h Te adh, Gains we describe ai in terms 
n or of drives leading to consummatory 


responses, we are talking about the same sequence of events. What con- 
trols the direction of movement is a tendency to restore a kind of equilib- 
rium, thus reducing tension. Freud talked about a return to a constant 
state, and the physiologists, after Cannon, refer to homeostasis. The prin- 


ciple goes back at least to Claude Bernard, but in any case it is shared 


by Freud and the learning theorists whose views have already been con- 


sidered, 
One of Freud’s concise stat 


introductory paragraph of his Be: 

i hesitation in assuming that the 
In the theory of ps choanalysis we have no g 
course taken bi EAE is automatically regulated by the pleasure prin- 
ciple. We belies that is to say, that the course of those events is invariably set 
in motion by an unpleasurable tension, and that it takes a direction such that 
its final pi coincides with a lowering of that tension—that is, with an 
avoidance of unpleasure or à production of pleasure.” 


d about the relationship between pleasure and quan- 
he said, corresponded to an increase in 


tion, although he allowed some impor- 
tance to the rate of change of excitation. In this bo followed a lassie 
Psychologist, Fechner, who wrote along the same lines much earlier. 
Until lately, American psychologists have preferred te) gale about 
tension and its reduction without bringing in the term “pleasure.” More 


From Beyond the pleasure principle by Sigmund Freud, 
iblishers, New York. 


conception, common to bo 
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Y permission of Liveright Pu 
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recently, however, attention is again being paid to affectivity.* McClelland 
and his associates (1953) have proposed a quantitative theory of pleasant- 
ness and unpleasantness, somewhat along the lines suggested by Freud. 
They propose that a slight departure from the level of stimulation to which 
the organism is adapted is usually pleasurable, while an extreme departure 
is unpleasant or annoying. Thus if one adds salt to distilled water, small 
quantities make the taste more acceptable, but large quantities make it 
less acceptable. Freud suggested that this was a field open to experimenta- 
tion, and indeed it is, 

If we are to move to more sophisticated and refined theories of learn- 
ing, it is imperative that we pay attention to the differences between 
proposals as well as to their simila 
Freud's tension-reduction the 
law-of-effect discussions, This 
that remains unsatisfied prod 


satisfy the need. That is, tension reduction that is not accomplished 
through actu: 


al gratification is accomplished through wish fulfillment, in 
the form of i ject. e id level this 


, just as the thirsty man on the desert 
confuses the mirage with a true oasis, In the primary process, under the 


control of the pleasure principle, wish fulfillment is hallucinatory; the 
later, more realistic expectation of future events comes with the develop- 


ment of the secondary process, when memory images become substituted 
for hallucinations. 


aa 


The problem of hallucinations has not been entirel 


y neglected by 
learning theorists. Mowrer 


(1938) proposed the study of hallucinations as a 
means of investigating certain aspects of expectancy, Dollard and Miller, 
treating perception as a response, and a hallucination as a learned per- 
ception in the presence of minimal or ambiguous cues, accept the fact that 
hallucinations are more likely if drives are strong. They cite evidence 
for their position.® 

The first area of correspondence, then, between le: 
psychoanalysis, is the family resemblance between th 


wever, that learning 
4 E.g., Young (1952), 
5 Freud (1920b), page 2. . 
€ Dollard and Miller (1950), pages 178-181. The evidence is from McClelland 
and Atkinson (1948) that images are influenced 


i by drives, and from Holmberg (1950) 
that hallucinatory images in dreams are influenced by motivation, 
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theories have not fully incorporated the fantasy-production feature of the 
Freudian principle. 


The Reality Principle and Trial-and-Error Learning 


Freud’s first supplementation of the pleasure principle was by way of 
the reality principle, whereby the organism instead of insisting upon 
immediate gratification takes the “long indirect road to pleasure.” * Because 
the pleasure is found at the end of the path, we may think of the psychol- 
ogist’s rat, winding its way through a tortuous maze, eliminating the 
blind alleys along the way, as being under the control of a reality principle. 

There is some ambiguity here, for Freud’s reality principle is bound 
up with the developing ego, and it would be a big jump to assign the 
rat an ego because it avoids blind alleys on the way to food. 

Psychologists have been unclear in trying to find the correspondences 
between their principles and Freud’s reality principle. 

Dollard and Miller definitely relate the pleasure principle to the law 
of effect, but they have no direct counterpart to the reality principle. Ego 
strength (related, of course, to mastery of reality) they treat in the fa- 
miliar psychological terms of higher mental processes, learned drives and 
skills. Reality, they say, is elaborated according to the physical and social 
conditions of learning.* 

Mowrer takes a different position. He believes that the law of effect 
is related to the pleasure principle, in agreement with Miller and Dollard, 
but he would relate trial-and-error problem solution to this principle 
also, rather than to the reality principle. He believes the reality prin- 
ciple to be appropriate instead to simple Pavlovian (classical) condition- 
ing, because he believes Pavlov’s dogs learn “not what is pleasurable and 
relieving, but what is actual, true, real.” ° He goes on to infer that such 
conditioning is limited to the autonomic nervous system, and thus leads 
himself into what appears to be a most unusual position, namely, that 
reality learning is under the control of the autonomic system, whereas 
pleasure-pain learning is under the control of the central nervous system. 
To some it will seem more plausible to assign instinctive, impulsive func- 
tions to the autonomic system (though I would not go that far) and to 
assign reality-regulated functions, ego functions, to the cognitive appa- 
ratus: to the central nervous system and its related structures. 


7 Freud (1911). 
8 Dollard and Miller (1950), pages 9-10. 
® Mowrer (1950), pages 5-6. 
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Let us return to animal learning as possibly representing the reality 
principle because there are many events intervening between the aroused 
tension and its ultimate resolution. A white rat can learn the true path 
in a maze, and in a discrimination apparatus it can learn to jump toa 
triangle and not to a circle. Thus it learns permitted and unpermitted be- 
havior, much in the way a child learns to behave in accordance with the 
dictates of its culture as transmitted by those in authority over him. The 
rat can learn probabilities also. If rewarded more frequently on the right 
than on the left he learns to turn more often to the right, as though to 
equate the response-reinforcement ratio on both sides.?° This conforming 


of behavior to reality, even to a ee ai reality, surely has some resem- 
blance to the earliest ego functions, 


volves conflict, and the conflict is resol 
choices by reward and punishment, 
through the accumulation of experiences with the environment, Some- 
times, when discrimination is difficult, the rat mobilizes his experience 
through what Muenzinger and Tolman call “vicarious trial and error” 
(VTE), that is, a sampling of both choices by looking back or forth, or by 
making short runs first to one side and then to the other before the ani- 
mal is committed to a choice. This appears to be a kind of symbolic 
reality-testing, an analogue of ego-type behavior. f 
Academic psychologists and psychoanalysts spend a good de 
time studying the approach to and solution of reality problems, We may 
conclude that there is some gross correspondence in topic between what 
Freud calls the reality principle and the studies in which behavior is 
regulated largely by experienced success and failure. What is ] 
in the co-ordination is a clear ego psychology in academic 


be co-ordinated with the ego theory in psychoanalysis, The e 
of psychoanalysis is itself only n 


earned discrimination always in- 
ved through the weighting of the 
These weights become available 


al of their 


acking 
psychology to 
80 psychology 
ow undergoing clarification, 


Repetition-Compulsion in Relation to Theories 
of Habit Strength 


In his Beyond the pleasure principle, Freud introduced a nu 
highly speculative notions, with awareness that he was tr 
to which he did not wish fully to commit himself, 
most closely bound to evidence from his clinical 


mber of 
ying out ideas 
One of the Suggestions 
experience was that of 


10 E.g., Brunswik (1939), 


41 For developments within psychoanalytic ego-theory, see Erikson 1950 
(1936), H. Hartmann (1950), Loewenstein (1954). ( ), A. Freud 
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a repetition-compulsion. These repetition-compulsions are believed to be 
beyond the pleasure principle, for Freud says: 

The greater part of what is re-experienced under the compulsion to repeat 
must cause the ego unpleasure, since it brings to light activities of repressed in- 
stinctual impulses. . . . The compulsion to repeat also recalls from the past 


experiences which include no possibility of pleasure, and which can never, even 
long ago, have brought satisfaction even to instinctual impulses which have 


since been repressed.'* 


Freud recognized that sometimes repeating unpleasant activities_— 


yields pleasure, possibly through the sense of mastery that is achieved. 


These cases he believed to be genuine, but they are still under the 


control of the pleasure principle, and are not what Freud meant by acts 


not under the control of the pleasure principle. He was not talking about 
secondary or derived pleasure; he referred to behavior that he believed to 
be independent of, and disregarding, the pleasure principle. 

In the effort to understand these repetition-compulsions, especially 
as they appeared in traumatic neuroses, Freud was led to posit a destruc- 
tive instinct (“death instinct”) existing alongside of Eros (the “life in- 
stinct,” comprising self-preservation, self-love, and object-love). This is 
one of the more controversial topics within psychoanalysis itself, and some 
relatively orthodox analysts find Freud’s formulation unsatisfactory. 

This is not the place to enter into a detailed critique of psychoanalytic 
theory, and in relating the work of academic psychologists to Freudian 
theory we must be satisfied to review some of the points of correspond- 
ence, In general, academic psychologists who have discussed repetition- 
compulsion have confined themselves to the kinds of behavior resistant to 
the usual principles of extinction. Mowrer, for example, notes the parallel 
between Hulls interest in acts that do not extinguish and the repetition- 
compulsion, and he sees a relation between repetition-compulsion and 
functional autonomy.” Dollard and Miller indicate that the repetition- 
compulsion is a result of cues provided by emotions that are themselves 
aroused by meaningless shreds of memories that recur in free association. 
Instrumental acts (the tendencies to repeat) are conditioned to these 


cues,?® 
Learning theorists have offe 
usually resistant to extinction: 


12 Freud (1920b), page 21- From Beyond the pleasure principle by Sigmund Freud, 
by permission of cae Publishes New York. 

18 E.g., Fenichel ( 1945), pages 5 i 

14 Mowrer (1950), pages 9, 122 253. 

15 Dollard and Miller (1950), page “°° 


red three suggestions to explain acts un- 
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1. All compulsions may conform to ordinary principles of tension- 
reduction learning. They may involve subtle forms of anxiety reduction, 
as in the defense mechanism known as “undoing.” It is admitted that 
available data are insufficient to demonstrate that this explanation holds 
in all cases.1* 

2. Overlearned activities may be resistant to change, hence lead to 
excessive repetition, Rats trained to jump a gap in order to be rewarded, 


if they repeat the activity often enough, will fail to take a direc 


t path 
when one is provided.” A kind of “canaliz 


ation” takes place, so that the 


repetition-compulsion. 

3. Behavior acquired under exc 
mally fixated,” and thus resist 
by Maier (1949) on the basis 


now on the 
he windows 
reotyped be- 
. The response 


between the 
ated response, Even when 
od clearly in view, the rat, 
he left, bump its nose, and 
o an experimenta] analogue 


The experimental problem has been r 
who repeated Maier’s experiments with a 
the main cause of the excessive fixation to 
that was occasionally punished, occasional 


16 Mowrer (1950), page 112. 

17 Gilhousen (1931). ; 

18 The possible significance of ‘ 
pages 161-191, 


eopened by Wilcoxon (1952), 
number of controls, He found 
be the the learning of an act 
ly rewarded, when the alter- 


canalization” has been treated by Murphy (1947) 
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native was invariably punished. While he interpreted his results as con- 
forming to ordinary principles of associative learning, the causal factors 
are complex. 

Compulsive behavior is a profitable field for further investigation, 
apart from trying to test in detail Freud’s hypotheses. Fenichel lists three 
types of compulsion.? The first type rests on the periodicity of drive 
states (instincts), so that derivatives of these states may be expected also 
to occur periodically. He believes that manic-depressive phenomena fit 
this type. The second type is a repetition due to the tendency of the re- 
pressed to find an outlet. In the so-called “neuroses of destiny” the patient 
periodically evokes or endures the same experience. The symptoms are 
sometimes set off by “anniversaries” of significant events.*? Compulsive 
rituals, such as compulsive counting, are said by Fenichel to fit this type. 
The third type consists in the repetition of traumatic events for the pur- 
pose of achieving a belated mastery. Children who have been the passive 
victims of frustration may seek in their games an active repetition of the 
event at a time of their own choosing, in order to gain comprehension and 
mastery, The dreams and repeated symptoms in traumatic neurosis fit 
this pattern. 

It is evident that there are problems here that have not been fully 
represented in learning experiments, although again there is overlap in 
topic, that it, between repetition-compulsion and overlearning, abnormal 


fixation, and resistance to extinction. 


vases 1 (the pleasure principle, the reality 
These three principles from Freud Pleasure pr : y 


principle, and yepetition-compulsion ) have een selected to show that 
theréare at least some parallels between Freudian-theory and the dis- 

eorists. It would no doubt prove fruitful to try to 
with respect to these parallels, for the interrela- 
obvious and direct. 


cussions by learning th 
achieve greater clarity 
tionships are by no means 
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19 Fenichel (1945), pages 542-548. 
20 E.g., J. R. Hilgard (1953). 
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Freudian theory, and Freudian theory was not developed in order to 
“explain” the results obtained by experimental psychologists. There have 
been a number of topics within Freudian theory, however, that have 


definitely set problems for the experimental psychologist. We now turn 
to a review of some of these. 


Anxiety as a Drive 


Freud's views about anxiety evolved along with other aspects of his 
theory. At first he thought that attacks of anxiety in his patients gave 
evidence of repressed instinctive (libidinal) excitation. That is, repressed 
libido, he thought, was transformed into anxiety.** Later, as he developed 
the theory of id, ego, and superego, he assigned anxiety production to the 
ego. When the ego perceives danger this perception arouses anxiety, and 
then steps are taken to reduce anxiety. Symptom formation may be one 
of these steps. Repression may also be one of the devices adopted by the 
ego, so that, instead of repression causing anxiety, we now have anxiety 
causing repression.?* 

Anxiety is closely related to fear, though fear usually has a real ob- 
ject, while anxiety may be a vague apprehension of unknown 
Three kinds of anxiety can be distinguished, of which the first is i 
guishable from fear: 

1. Objective anxiety (also called real or true anxiety) depends upon 
real or anticipated danger whose source lies in the external world, To be 
afraid of a poisonous snake or a holdup man is to be anxious in this way. 
True anxiety implies a real known danger. 

2. Neurotic anxiety is in regard to an unknown danger. Upon analysis 
it is found that the danger is, as Freud put it, an instinctual one. That is, 
a person is afraid of being overpowered by some impulse or thought that 
will prove harmful to him. Sometimes there is a real or threatened danger 
but the reaction to it is excessive, thus revealing the neurotic element in 
the anxiety.** 

8. Moral anxiety is aroused by a perception of danger from the con- 
science (superego). The fear is that of being punished (belittled, de- 
graded) for doing or thinking something that is contrary to the ego ideal, 
Moral anxiety is experienced as feelings of guilt or shame." 


*1 A useful account of the changes in Freud’s conceptions is given by R, May (1950) 
pages 112-127. : 
22 Freud (1920a), pages 347-349, 
23 The new views can be found in Freud (1926). 
24 For the distinction between these first two types of anxiety, see Freud (1926) 
pages 112-117. > 
25 For a discussion of the three kinds of anxiet > See Hall (1954), 
shame and guilt, see Piers and Singer (1953). 7 ( ), pages 59-69, On 


danger. 
ndistin- 
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Anxiety in later life was said to have two modes of origin: one, in- 
voluntary and automatic, whenever a danger situation arose, the second 
“produced by the ego when such a situation merely threatened, in order 


to procure its avoidance.” ** 


Mowrer (1939) was the first of the experimenters on learning to see 
the possibility of studying an analogue of anxiety in experiments on avoid- 
ance conditioning. He proceeded to show experimentally how rats that 
had experienced pain in the presence of certain cues could be led to make 
avoidance responses in the presence of those cues, that is, when the pain 
was merely threatened.** Thus he believed the anxiety-fear response to be 
the conditioned form of the pain response. Just as pain can be used as a 
drive to produce learning through escape, so anxiety can. be used to pro- 
duce learning. The anxiety can be reduced by appropriate avoidance re- 
sponses just as pain can be reduced by escape responses. Hence these 
anxiety-reducing responses are learned by the ordinary reinforcement 
principle of tension reduction. Conceivably defense reactions or neurotic 


symptoms might be acquired through this familiar learning pattern, This 
or anxiety has been extensively studied as a drive 


kind of acquired fear 


in animal learning, especially by N. E. Miller.” 
Mowrer’s views changed somewhat as he began to consider those 


features of human behavior less represented in rat behavior, especially 
man’s capacity to consider the future consequences of his acta He gradu- 
ally arrived at his own position on anxiety, at — with Freudian 
theory. He developed what he called a “guilt theory” as contrasted with 


Freud’s “i Ise theory.” *° 
The ma ae of anxiety, shame, and guilt lead far afield from present- 


day learning theory. The main point of contact at present is in the treat- 
ment of a fear as a drive. The importance of fear as a drive is thus 


summarized by Dollard and Miller: 
i r i f all is fear, or “anxiety” as it is often 
t important drives o € y 
Salle, as, of a S Mor is Fee or obscured by repression (Freud, 1926). There 
are Ate: its Santor why fear is so important: because it can be so strong, 
Sie aed to new cues so easily by learning, and because 10I 
the AON SLE that produces the inhibiting responses in most conflicts.*! 
SAK somewhat different, yet related, use of the concept of anxiety in 


learning experiments was introduced by Taylor (1951), when she classi- 


26 Freud (1926), page 109. 

a Mae ee i a review of other related studies see Miller (1951b), Farber 
(1954). r 

29 Mowrer and Ullman (a ). 

30 Mowrer (1950), page 531. 

51 Dollard and Miller (1950), page 190. 
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fied human subjects in an investigation of eyelid conditioning according 
to their scores on a verbal “anxiety scale” derived from the Minnesota 
Multiphasic Personality Inventory. She found that the more anxious sub- 
jects conditioned more rapidly than the less anxious ones, and interpreted 
anxiety as drive—as the multiplier D in Hull’s theory. A number of ad- 
ditional experimenters have used her scale in testing various hypotheses 
about the relationship between anxiety and conditioning.** 

Our review shows us that in the course of experimentation theories 
of anxiety have diverged from the testing of Freud’s interpretations. But 
the original introduction of anxiety concepts into learning theory was by 


those who were interested in Freudian theory, and they indicated in- 
debtedness to Freud’s writings. 


Unconscious Influences upon Word Associations 


The free-association technique of psychoanalysis has found its repre- 
sentation in psychological experimentation chiefly through the diagnostic 
word-association test introduced by Jung. Many experiments have been 
done, acknowledging the importance of the emotional processes behind 
word associations, leading quantitatively to the recognition of a number 
of “complex indicators,” * Although these have not had much influence 
within learning experiments, lawful relations can be found, as between 
the frequency of response words and the reaction times with which they 
are given,** 

Word recognition, which is also a form of associative reaction to the 
printed word, has been studied in relation to 
of the words. These studies, often referred 
defense, are reviewed later in this chapter. 


the affective significance 
to as studies of perceptual 


Repression, Forgetting, and Recall 


Ever since Ebbinghaus, psychologists have made quantitative study 
of memorization and recall part of their laboratory practice. It was natu- 
ral, therefore, that Freudian interpretations of memory lapses should have 
been seized upon as appropriate for experimental testing. At first, psy- 
chologists grasped only the forgetting aspect of repression, failing to take 
into account the motivational aspects. This was due, in part, to the re- 


82 Bindra, Paterson, and Strzelecki (1955), Bitterman and Holtzmann (1952), Far- 
ber and Spence (1953), Hilgard, Jones, and Kaplan (1951), Montague (1958), Ra- 
mond (1953), Spence and Farber (1953) (1954), Spence and Taylor (1951) (1953 ) 
Taylor and Spence (1952). For a valuable review of both experiment and theory Seo 
Farber (1954). The scale itself is described in Taylor (1958). d 

33 E.g., Hull and Lugoff (1921). 

34 Seblocberg and Heineman (1950). 
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jection of the psychoanalytic theory of instincts, in part to the absence of 
motivational concepts in prevailing studies of memory and forgetting. 

Freud distinguished between a primal repression and a second phase 
called repression proper.** The primal repression consists in “a denial of 
entry into consciousness to the mental (ideational) presentation of the 
instinct.” The ideational content then remains unaltered and the instinct 
remains attached to it. “The second phase of repression, repression proper, 
concerns mental derivatives of the repressed instinct-presentation, or such 
as, originating elsewhere, have come into associative con- 


trains of thought 
to ward off these secondary derivatives 


nection with it.” The tendency 
is sometimes called after-expulsion. 

It is to be noted that repression proper assumes a primal repression, 
so that the pattern for all later repressions is set up early in life. Later 
events may be assimilated to these earlier ones. The activity of repression 


is not, however, over at the time something is repressed. 


s not to be regarded as something which takes 
s of which are permanent, as when some living 
t time onward is dead; on the contrary, re- 
diture of energy, and if this were discon- 
would be jeopardized, so that a fresh act 


The process of repression i 
place once and for all, the result 
thing has been killed and from tha 
pression demands a constant expen 
tinued the success of the repression 
of repression would be necessary.*? 


Sears has summarized the characteristics of a repressed instinctual 


i i ix . 38 
impulse, according to Freud, in six statements: 


true form in consciousness. 


1. It is not represented in its r € 
p evelops in a more luxuriant fashion than it 


2. The instinct-presentation d 


would if it were conscious. i p “ia i: 
3. The resistance of consciousness against derivatives and associations of 


the instinct-presentation varies in inverse proportion to their remoteness from 


the ideas origina essed. 
pa i is highly specific to each idea and substitute idea. 


5. Repression is very MODHE. f B 
6. The degree of repression varies with the strength of the instinctual im- 
pulse. 


The feature of repression that has appealed most to psychologists is 


its relation to pleasure and pain. Freud says, for example, that in repres- 
sion the avoidance of pain must have acquired more strength than the 
pleasure of gratification.” Psychologists have often interpreted repression 


35 These points were made by Sears (1936a) in an abile devte of Brends Mengde 
repression and the possibilities within experimentation. 

#8 Freud (1915a), page 86. 

aF, f a 9. 

3s A (1936). ages 949-243, based on Freud (1915a). 

39 Freud (1915). page 85. The line of development began much earlier, 


302 THEORIES OF LEARNING 


quite superficially to mean that pleasurable events will be remembered 
and distasteful ones forgotten. The first experiments used merely affec- 
tively pleasant or unpleasant materials, such as pleasant and unpleasant 
odors, words such as “sugar” (pleasant) and “quinine” (unpleasant), 
studying whether or not such materials could be easily memorized or 
retained.*° Later studies have sought in one way or another to meet more 
squarely the demands of psychoanalytic theory. We shall return later to a 
consideration of some of them. 


Fixation 


Fixation has two closely related meanings in psycho 
The first meaning is that of arrested development, so th 
be fixated at an infantile or adolescent level of 
This usually implies an object-choice (the moth 
propriate to the level of fixation, and the statement is then made th 
the person is fixated at such-and-such level and fixated upon such- 
such an object. The second meaning is that of fixed habits le 
ferred modes of solving personal problems, such as a fixation upon a 
particular mechanism of defense, The habit fixations discussed as forms 
of compulsion (pages 294-297) are fixations of this second kind. The two 
meanings are not clearly distinguished in psychoanalytic theory because 
the stage of arrested development is so intimately related to the style of 
life, That is, habitual modes of reacting as an adult are likely to be de- 
scribed in developmental terms, for example, the “anal character,” 


analytic theory. 


and- 
ading to pre- 


the 


arlier stage makes entrance upon the 
arrested development may oc- 


40 E.g., Gordon (1905) (1925), Tait (1913), Tolman (1917). 
41 Fenichel (1945), page 523, writes that Neurotics “are 
levels of instinctive demands but also to certain mechanism: 
cludes “character attitudes” among the items to which they 

42 Sears (1943), pages 81-89. 


not only fixated to certain 
s of defense.” He also in- 
are fixated, 
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cur. Habits at the different levels are not all-or-nothing affairs, for an 
adult who is grown up in some habits may be infantile in others. Ultimately 
We require a quantitative theory to account for arrested development in 
some aspects of personality and progressive development in others. 


Regression 

Regression is related to fixation in that when an act is blocked or 
frustrated, some substitute will occur. The substitute is quite likely to be 
an act once strongly fixated in the individual’s repertory. Such a substitu- 
tion of an earlier habit for a contemporary one represents one kind of 
regression. 


We may distinguish three kinds of regression: 
1. Instrumental-act regression. When the organism is prevented from 


using one habit, an earlier learned habit is substituted. Many animal ex- 
periments have demonstrated this to occur.** These results provide an 
analogy to object-regression in Freudian theory, the kind of regression in 
which gratification is gained by relinquishing a present object and return- 
ing to an earlier one. 

2. Age-regression. Under some circumstances the perscn returns to 
earlier modes of behavior and obliterates future events, in a kind of re- 
vivification.“ If, for example, an adult who regressed under hypnosis to 
an early age began to speak in a language used in childhood but no longer 
available to him in his adult life, we could say that he had regressed 
(at least in this respect) to an earlier period in his own life. Experiments 
under hypnosis have led to somewhat ambiguous results. Occasionally 
the functioning at the suggested (regressed ) age resembles very closely 
the known historical circumstances in the person’s life.** When regressed 
Subjects take intelligence and personality tests, their performances differ 
from those of actual children at the age levels represented by the sug- 
gested regression. Hence there appears to be a certain amount of role- 

layin i 46 
P- a aA Even though the regressed individual may not return 
either to instrumental acts once in his inventory of habits or to personality 
functioning characteristic of himself at some previous period, he may, 
under stress, show a kind of behavioral disorganization that can be char- 

43 E.g., Hamilton and Krechevsky (1933), Mowrer (1940a), O'Kelly (1940a) 


(194 a 89-96, has reviewed this literature. i ; 
‘A The cous nE Weitzenhoffer (1953), page 191, as one classification of 


Tegressi ifestati der hypnosis. 
8ressive manifestations un yP kupfer (1945). 


45 Fri ill (1948 si e " 
46 rp teed ee On intelligence tests, see Kline ( 1950), Sarbin (1950). 
For studies with personality tests, see Sarbin and Farberow (1952), Orne (1951). 
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acterized as more primitive. This kind of regression has been studied with 
young children subjected to mild frustration.“ They showed regression 
by such indices as reduced constructiveness in play following the ex- 
perienced frustration. 

Psychoanalysts suggest that some psychotic manifestations are regres- 
sive, particularly schizophrenia. DuBois and Forbes (1934) found little 
return to the fetal posture during sleep. Cameron (1938a) ( 1938b) failed 
to find childish thought patterns in schizophrenic speech. Sears points 
out, however, that these studies are inadequate tests of regression because 


they do not bear adequately on the possible affective indicators of regres- 
sion.*® 


Aggression and Its Displacement 


In his later years Freud becam 
hostility, and destructiveness, In disc: 
group antagonisms he said: 


e increasingly aware of aggression, 
ussing racial tensions and other inter- 


We do not know why such sensitiveness should have been directed to 
just these details of differentiation; but it js unmistakable that in this whole 
connection men give evidence of a readiness for hatred, an aggressiveness, the 
source of which is unknown, and to which one is tempted to ascribe an ele- 


mentary character.*® 

Freud recognized the importance of giving expression to hostility, 
rather than holding it back, and many contemporary analysts now give 
prominence to the role of repressed hostile impulses in neurosis, 

While Freud thus paid increasing attention to 
later writings, as something inherent in man, he had earlier suggested 
that interference with instinctual satisfaction leads to a hostile attack 
upon the source of the frustration, Symptoms of neurotic illness may be 
so directed as to cause distress to someone in the environment who is 
perceived as the agent of frustration, Building upon these sugges 
Freud (1917), Dollard and his collaborators formul 
aggression hypothesis in a manner to make possible 
The general principle stated was that frustration leads to aggressive ac- 
tion; it was necessary later to correct the implication that aggression was 
the only (or even an inevitable) consequence."! The experimental eyi- 


aggressiveness in his 


tions in 
ated the frustration- 
quantitative testing.‘ 


47 Barker, Dembo, and Lewin (1941). 

48 Sears (1944), page 315. 

49 Freud (1921), page 56. This “elementary character” he had assigned a year before 
to a “death instinct.” 

5° Dollard, Doob, Miller, Mowrer, Sears, Ford, Hovland and Sollenberger (1949). 

51 Miller (1941), Sears (1941). These two papers are part of a symposium on the 
effects of frustration, The other papers are Rosenzweig (1941), Bateson (1941), Levy 
(1941), G. W. Hartmann (1941 ), and Maslow (1941), 
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dence gave abundant support to the hypothesized linkage between ex- 
perienced frustration and subsequent aggressive, hostile, or destructive 
behavior. 

If the agent responsible for frustration is unknown or inaccessible 
as an object of attack (absent or protected through conflicting response 
tendencies), another object will be chosen. This is called displaced ag- 
gression. The object may be as innocent as the scapegoat which in the 
ancient Hebrew ritual was made to bear the nation’s sins.** The word 
“scapegoat” has come into general use to refer to a victim of displaced 
aggression. 

Neal Miller and his collaborators have succeeded in relating dis- 
placement to stimulus generalization in learning theory." In his original 
experiment, Miller taught rats to strike each other, as in fighting, when 
an electric shock came on. When a second rat was not present, a rat would 
“displace” the aggression to a celluloid doll or other object in the environ- 
ment. The gain to be expected from Miller's experimental arrangement 
is that certain quantitative predictions can be made about the occasions 
for displacement, the intensity of the reaction, and so on. Miller formu- 
lated eight specific deductions from his theory, of which the following 


three are illustrative: 


When the direct response to the original stimulus is prevented by the 
displaced responses will occur to other similar stimuli 


absence of that stimulus, $ 
ha ll occur to the most similar stimulus 


and the strongest displaced response wi 
present, 

When the direct response to t 
the strongest displaced response wi 


degree of similarity to the original one. : 
er If the pee: A of the drive motivating the direct response to the original 


stimulus is increased, it will be possible for increasingly dissimilar stimuli to 
elicit displaced responses.” 
While Miller’s experiments on displacement have been with animal 


subjects, he shows how the principles suggested may apply to human 
s, discriminations tend to take the form of words, 


so that we narrow our meanings and reduce our generalizations. The 
reverse holds true also, and we use words to generalize, to name classes 
of objects that belong together. At this point, however, we are interested 
in discrimination and how it breaks down. Suppose that Mr. Brown dis- 
tinguishes among his new neighbors by calling them all by name. Because 
of a painful snub, he becomes very angry at Mr. Jones, one:of these neigh- 


the scapegoat is in Leviticus 16:22. 
aeling (1952), Miller and Murray (1952), Murray 


the original stimulus is prevented by conflict, 
ll occur to stimuli which have an intermediate 


behavior. In human being 


52 The Old Testament account of 
53 Miller (1948b), Miller and Kr 


and Mill 52). r 
54 oa a a are quoted from Miller (1948b), pages 168, 170. 
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bors. Because Mr. Jones's social position is more secure than Mr. Brown’s, 
and Mr. Brown is new to the neighborhood, Mr. Brown may repress his 
anger. (This fits the suggestion that the expression of hostility may be 
prevented by conflicting tendencies.) In Miller's interpretation, what he 
represses is the sentence: “I am angry at Mr. Jones.” Because this sentence 
is repressed, the anger is no longer tied directly to Mr. Jones, and it is 
more likely to generalize to another neighbor because discrimination 
through names has broken down. 

Miller recognizes that the displacement problem is actually more com- 
plex than this. He points out that the name of Mr. Jones is not repressed 
when it is not in a sentence referring to anger. So Mr. Brown recognizes 
the neighbor’s dog and says to himself: “This is Mr. Jones’s dog.” Because 
the name of Mr. Jones is attached unconsciously to hostile tendencies, he 
may give the dog a swift kick, which he would not do if he recalled that 
he was really angry at Mr. Jones, not at his dog. Thus Miller has deduced 
some functional differences between consciously and unconsciously de- 
termined behavior, He has restated his deduction succinctly: 


To summarize, the repression of verbal responses specifying the source of 
the aggression may remove a basis for discrimination and allow the illogical 
generalization, or displacement, of that aggression to be mediated by a different 


verbal response which is not repressed,°° 

Some closely related observations arise out of the study of doll play 
by young children. When attention is paid to aggressive behavior (fantasy 
aggression, in this case), the choice of object follows some of the principles 
of conflict outlined by Miller. That is, where anxiety is high, the object 
of aggression is chosen as one less similar to the person who is the source 
of the aggression than when anxiety is low. Consequently, as children play 
repeatedly in a permissive environment, their choice of object gradually 
shifts toward the parents, who have commonly been the authority figures 
responsible for some of their frustration.* 

We may conclude this section by pointing out that on these topics 
(anxiety, affectivity and verbal responses, repression and recall, fixation 
and regression, aggression and displacement) the theories of Freud pro- 
vided a stimulus to research. At the empirical level this has led to a fruit- 
ful interaction between psychoanalysis and experimental psychology, 


55 Miller (1948b), page 176. 


56 P, S. Sears (1951), R. R. Sears (1951), R. R. Sears, Whiting, 


Nowli: 
P. S. Sears (1953). owlis, and 
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SUGGESTIONS FROM PSYCHOANALYSIS LITTLE 
REPRESENTED IN PYCHOLOGICAL STUDIES 
OF LEARNING 


One has the feeling that a good deal of rich material from psycho- 
analysis has eluded the academic psychologist. This, if true, might be due 
either to a certain obtuseness on his part, or because the richer material 
is too difficult to bring under experimental control. The problem has been 
discussed repeatedly. Thus Sears at one time came to the conclusion: 


It seems doubtful whether the sheer testing of psychoanalytic theory is an 
appropriate task for experimental psychology. Its general method is estimable 
but its available techniques are clumsy.“ 
ul discussion of techniques of verification of 
felt that a limitation of the laboratory was 
e the real dangers or deal with the basic 


Kris, in a very thoughtf 
psychoanalytic propositions, 
that the laboratory cannot produc 
needs which the genetic propositions of psychoanalysis encompass: 
laboratory to quasi needs (and quasi dangers) seri- 
ously restrict the area of propositions that can be experimentally verified. In 
fact, up to the present, experimental approaches have been more successful in 
dealing with propositions concerning substitution 58 than they have been with 
propositions concerning repression.” 

Kubie, while eager to see research on psychoanalysis move forward, 
and even outlining a research institute in psychoanalytic psychology, de- 
plores a certain triviality in many of the psychologists’ experiments on 


psychoanalytic phenomena. 


The limitations of the 1 


It is important that any experimentalist should first make himself thoroughly 


familiar wi a as these occur in nature, ascertaining what can be 
ar with phenomena 4 pie = 
ap and ear before deciding what to subject to ex- 


roved wi naided eye a4 
E RE F z Experimental facilities should not be wasted on 
. roved, and to which human bias alone con- 


issues which are already clearly pro 
tinues A blind fi "The experimentalist should rather take up where the natural- 


ist leaves off.°° 
Related to Stages of Development 

Writers on psychoanalysis often stress that it isa genetic as wallvasta 
dynamic theory." That is, continuities in the life of the individual must 
always be taken into account along with what is happening in the present. 


Learning as 


57 R. R. Sears (1944), page 329. The antecedent of “its” is experimental psychology, 


58 Henle (1942b) is referred to here by Kris. 
59 Kris (1947), page 255. 

°° Kubie (1952), pages 64-65. _ 

61 H. Hartmann and Kris (1945). 
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At many points the theory suggests that the very young child is unusually 
susceptible to influences which leave a permanent mark upon his per- 
sonality, so that, for example, dreams throughout his life may be influenced 
by these earliest learnings. If this interpretation is true, it is important 
for a general psychology of learning, and the evidence needs to be 
specified. . 

While the problems have not been formulated clearly in terms of 
learning (i.e., why and in what way the results of childhood learning are 
more permanent than later learning) we are beginning to get kinds of 
evidence supporting the gross facts of adult consequences of early child- 
hood experiences. 

Among the experiments with animals, the best known are the ex- 
periments of Hunt (1941) on the effects of infant feeding frustration on 
adult hoarding in rats, and of Wolf (1943) on the effects of sensory depri- 
vation in early life upon types of functioning under stress in later life, 
again with rats. While the results of these experiments are subject to some 
reservations, they are relevant to the genetic theory that early 

a show their influences in the conflicts of later life. 
‘Some conjectures about child-training practices and personality in 
adult life have been put to the test by Whiting and Child (1958). They 
went to the cross-cultural files, where anthropological reports on upwards 
of 200 cultures are summarized according to a gre 
Then they asked certain questions about childhoo: 
veloped hypotheses about their consequences for 
those cultures. To take one illustration, they 
anxiety established in early childhood should a 
the causes of illness in adult life. Even in our 
phasis upon the germ theory of disease, we have many subordinate in- 
terpretations of illness: “It must have been something I ate,” “Tve heen 
working too hard,” “You have to suffer for your sins.” Our remedies, too, 
show something of the magical quality: indiscriminate Prescriptions of 
cure by rest, by liquids, by bland foods, by cathartics, by play, 
tanning, by religious observances. If a “scientific” culture such as ours has 
all these magical residues, it is not too much to expect that primitive inter- 
pretations of illness might be related to deep-seated anxieties, 

Whiting and Child specified five kinds of anxie 
criteria for rating these. These five were: oral, anal, s 
and aggression. The assumption is that severity of tr 
of punishment for infraction in these areas, or f 
would lead to anxiety. A culture might be sever 


experiences 


at many categories, 
d practices and de- 
adult experiences in 
predict that socialization 
fect the interpretation of 
own culture, with the em- 


by sun- 


ty and developed 
exual, dependence, 
aining and severity 
ailures of gratification, 
e in one of these areas 
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and permissive in the others. In addition, Whiting and Child classified 
primitive interpretations of illness as derivatives of these anxieties. 

When the childhood experience was correlated with adult interpreta- 
tions of illness, some striking correspondences turned up. For example, 46 
cultures could be classified with respect to degree of oral anxiety in child- 
hood, and interpretations of illness with oral aspects, such as food poison- 
ing. Of 20 cultures high in producing oral anxiety in children, 17 used oral 
interpretations of illness. Correspondingly, of 19 cultures low in oral 
anxiety, only 6 used oral interpretations of illness. This kind of evidence 
does not prove a relationship between childhood anxiety and adult be- 
havior, but it makes such a relationship plausible. 

Another important suggestion from psychoanalysis is that of child- 
hood amnesia. In later life we appear unable to recall events that must 
surely have impressed us at the time, and about which we must have 
talked. Because these events become secondarily linked with repression 
sequences, they too are repressed. Memories that are recalled are often 
trivial, and perhaps distorted, the so-called “screen-memories,” probably 
serving some of the purposes of repression. This whole field has been 
touched hardly at all by academic psychologists, who ought to be able 
to explore it with children, and not have to depend upon the retrospective 
accounts of adult patients.” 

A further aspect of the developmental sequence posited by psycho- 
analysis, and relevant for learning, is the latency period, for it has been 


related to readiness for schooling (at about the age of 6) when the oedipal 
solved and the ego is ready to feast its curiosity 


problems are temporarily 
ter a few years adolescence produces a new 


about the external world. Af 


threat, and learning may again be disrupted. 
Most of the attempts to study the latency period by non-psycho- 
d serious doubts about the existence of such a period. 
the evidence up to 1952, reports that “there has been a 
growing trend, both within and outside psychoanalytic circles, to doubt 
the existence of sexual latency.” ° One very interesting experimental study 
has since appeared, giving results consonant with an orthodox interpreta- 
tion of latency.“ The developmental assumption is that castration anxiety 
will be high early (before the age of 7) and late (as adolescent develop- 
ment begins in the teen years). Using the method of incomplete fables, 
’s vi int, see A. Freud (19 age: 
p Zz Tor a discussion fom the pees ero tme were reviewed by G. J. Dialycha 
and M. M. Dudycha (1941 J 


63 Blum (1953), pages 133-134. 
64 Friedman (1952). 


analysts have raise 
Blum, in reviewing 
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Friedman found that between the ages of 7 to 12 his subjects more freely 
than younger or older children ended the stories with a strongly suggested 
dismemberment ending, showing reduced castration anxiety during this 
period. A supplementary finding was that latency by this criterion ends 
earlier for girls than for boys, a result that is plausible in view of the 
earlier onset of puberty among girls. Further material having to do with 
the sexual attachments of daughters for their fathers showed that from 
the ages of 7 to 11 the girls differed little from the boys, while from 11 
onwards there was a definite increase in the kind 
as sexual feelings toward the father, A decline 
at age 16. 


While our topic is learning, not personality development, if the genetic 
theory of psychoanalysis contains any truth, learning will show dif- 
ferences from one level to another. Hence developmental stages 
possible significance for learning. 


s of fantasies interpreted 
was possibly setting in 


are of 


Obstacles to Learning 


In case histories from child-guidance clinics we find many illustra- 
tions of obstacles to learning based on the personal history of the learner., 
The teacher as a parent figure may arouse false expectations or reinstate 
continuing battles; some symbols used in teaching may be so freighted 
with personal meanings as to be defended against; conflicts over aut} 
may result in non-reading or in spelling handic 
is clothed more with arbitrary authority than 
spelling.’ 

There is little recognition within 
matters so important for the kind of le 
environment of the home and school 
thaniel Cantor (1946) 


hority 
aps, for no subject matter 
English pronunciation and 


the learning laboratory of these 
arning that takes place in the soci 


al 
. At least one skilled teacher, N; 


a- 


amics of 
es of resistance, ambivalence, pro- 


Psychodynamics of Thinking 


The possibility of drawing upon psychoanalysis in studying the psy- 
chology of thinking has been brought strongly to the fore by the appear- 
ance of a large volume assembling papers on the organization and pa- 


85 Much of the relevant case material is reviewed by Mahler-Schoenberger (1942) 
and by Pearson (1952) (1954). 
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thology of thought. Especially important for the psychoanalytic influences 
are the sections devoted to symbolism (including experimentally pro- 
duced dreams), to motivation of thinking, fantasy thinking, and pathology 
of thinking. 

While, as on most topics, one can find a few pioneer investigations, 
the suggestions from psychoanalysis have not yet been fully incorporated 
into the experimental study of thinking. For example, Seeman (1951) 
lists 23 titles as relevant to the Freudian theory of daydreams, but only 
seven of these use any techniques of quantification at all. 

Learning at its best must include creativity, whether in the form of 
creative imagination embodied in the arts, or in inventiveness in science 
or practical affairs.” The hints from psychoanalysis are deserving of study. 

The study of character or personality syndromes may prove useful 
in determining why of two people, equally intelligent, one is creative, the 
other not.*8 We have to be careful not to give pat answers, based on a 
stereotyped conception of “typical” personalities. For example, the com- 
pulsive dispositions associated with the “anal character” are not creative 
in essence, and may, in fact, produce mere handwashing instead of pro- 
ductive effort, Yet there is a certain plodding quality about scientific data- 
gathering which may demand some of this same compulsiveness. Scien- 
tific objectivity, a valuable trait, is at the same time a form of dissociation 
or isolation between wish and intellect, and is perhaps achieved at some 
cost to the individual. Some papers by Kubie © have stressed the personal 
problems of the scientist. 

In the current emphasis on general-education programs in our col- 
leges and universities we hear much about the integrative role of the arts 
and music, Yet as we look at artists and musicians we see some truth in the 
popular conception that they are somehow different, and occasionally 
classify as deviates. Hence the question recurs of the relationship between 
art and neuroticism. One form of putting it is this: Would the artist be 
less creative if he resolved his conflicts through psychoanalysis? Is not 
o him? Possibly if we were successful in our mental 
hygiene programs we would develop a class of very uninteresting normal 
and contented people; perhaps we are saved from this by the ineffective- 
ness of our measures. There are more constructive possibilities. For one 
thing, artists and writers who come to analysis are those who find that 


his neurosis precious t 


also Rapaport (1950). r 
$ R: E Ait referenc, see Vinacke (1952), pages 238-261, 
98 E.g., Stein and Meer (1954). 


°° Kubie (1953) (1954). 
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they are afraid to practice their arts. For them, psychotherapy may give 
back their creativity. Again, the end of therapy is not necessarily to get 
rid of conflicts, but to find a way of living with them. The issues are too 
important to be resolved casually on the basis of a few case studies, 


Academic psychology ought to take its share of responsibility in the study 
of creativity at all levels. 


Therapy as Learning 


Experimental psychology will always be cut off from some of the 
more important aspects of psychoanalysis if it insists upon methods lying 


ee association, the 


tween magical omnipotence and reality, so that the patient unle 
neurotic habits and acquires realistic and socially 


realized that the handling of tra 
did not hesitate to use the word. 


with his patient, and how the patient reacted. Therefore the need 
sort of objective record is required. We probably need, in 
recorded interview, an immediately recorded statement f. 


associations that remained unexpressed. Where the ps 
in will be in the analysis and interpretation of the d 
vides.” 


Perhaps the main conclusion to be drawn from recognizing the un- 
explored areas in developmental aspects of learning, in psychodynamic 


70 Freud (1920b), page 18. 
^ For a discussion of these possibilities, see Bronner, A. F., and others (1949). The 
most satisfactory studies of psychotherapy at present are the investigations of client- 
centered therapy by Rogers and his associates, e.g., Rogers and Dymond (1954). For a 
treatment of psychotherapy as learning, see Shoben (1949). 
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obstacles to learning, in the study of thought processes and creativity, and 
in the understanding of the learning that goes on within psychotherapy, 
is that these problems can be solved only through co-operative effort. The 
division of labor between the academic psychologist and the psycho- 
analyst is not an absolute one, with the analyst doing everything that 
can be done within psychoanalysis, and the psychologist only that which 
can be done in the laboratory. If we are to achieve the ultimate under- 
standing of the learning process not only in all its richness, but also in 
the form of verified propositions, there must be collaboration as well as 


division of labor. 


EXPERIMENTS ON REPRESSION AS ILLUSTRATIVE 
OF THE LABORATORY USE OF PSYCHODYNAMIC 
CONCEPTS 


As indicated earlier (page 800), psychologists were quick to pick up 
the suggestion from Freud that affective factors might influence recall. 
We shall consider two problems: the relation between affect and ease of 
recall, and the relation between affect and recognition thresholds. Both 
imply that unpleasant affect may hinder recall or recognition through 
events identified with or analogous to Freudian repression. 


Repression and Recall 
A review of the pertinent literature by Zeller (1950a) lists 93 refer- 
ences, enough to suggest how inviting this field has proved to be to in- 
vestigators. We shall consider a few experimental reports typical of four 
approaches to the study of affective factors in recall. o 
1. Recall of associates to affectively-toned sensory stimuli. The most 
satisfactory study, following upon several earlier ones, is that of Ratliff 
(1938). He had subjects rate the pleasantness and unpleasantness of odors, 
Pitches, and colors. Then the subjects associated numbers with the af- 
fective items and later were tested for recall, The results with odors were 
against the hypothesis of the forgetting of the unpleasant, but with pitches 
and colors the recall of numbers associated wih the pleasant items was 
superior to the recall of numbers associated with unpleasant ones. Because 
the pleasantness and unpleasantness of sensory nme has little personal 
relevance, such experiments are no longer considered very pertinent as 
reudi; ry. 
tests - vie es sae i of the relationship to the repression theory, 
it is of some interest to cast up the score on experiments dealing with the 
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recall of pleasant and unpleasant items (including words as well as sensory 
stimuli). Of 51 studies of this kind, 32 (63 per cent) favor more effective 
recall of the pleasant over the unpleasant, 14 (27 per cent) favor the 
reverse, and 5 (10 per cent) have neutral or ambiguous results.7? 

2. Recall of memorized words with personal affective connotations. 
In order to introduce some kind of ego threat into the laboratory-type 
experiment, Sharp (1938) went to the case records of neurotic subjects 
to find words that would be emotionally unacceptable to individual sub- 
jects, and words that for them would express gratifications. She then com- 
pared their retention of unacceptable and acceptable words, and found 
the unacceptable words less well recalled. Unfor' 


ported repetition of portions of her work did not 
sults." 


tunately, the only re- 
yield confirmatory re- 


al 
words from a word-association test given to each subject, One of these 


words was imbedded in a list to be memorized. The recall of the list was 
then disrupted by retroactive inhibition through new learning inter- 
he disruption presumably 
included repression, for the critical word was more difficult to recall than 
used to alleviate the anxiety 
urage its recall much as for- 
s in psychoanalysis, Although 
tried to reproduce the memo- 


to commend it, however, that it is to be hi 
variant of his experiment can be discovered or invented. 

Both the Sharp and the Keet experiments are more relevant to Freud- 
ian theory than the other experiments on affective materials because the 
selected items with a history of personal affective loading in the life of the 


individual outside the laboratory. Then they brought these items into the 
laboratory for quantitative study. 


8. Recall of affectively-toned life experiences. Ina somewhat different 
effort to achieve n 


aturalness in experiments, experimenters have aban- 
doned the memorizing of material under the artificial conditions of the 
laboratory and turned instead to study of the recall of actual experiences 
met outside the laboratory. 


Meltzer (1930) asked 77 college men and 55 college women to de- 


72 The tally was made by Zeller (1950a), page 42. 
7 Heathers and Sears (1943); see Sears (1943). 
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scribe their Christmas vacation experiences. He not only had them list their 
experiences, but also asked them to rate the experiences as pleasant, un- 
pleasant, or indifferent. Six weeks later he unexpectedly asked them to 
repeat the listing they had done on the day of their return. Not only did 
they list more pleasant than unpleasant memories immediately after re- 
turning from vacation, but after six weeks the predominance of pleasant 
over unpleasant memories increased. This finding is in accord with the 
interpretation of repression as an active process, continuing after the 
original event, though other interpretations are possible." 

A social-psychological bent was given to the naturalistic-setting re- 
call experiment by Edwards (1941) (1942) who studied the retention of 
political statements by those differing in their political views. He found 
that his subjects tended to remember better material compatible with 
their political attitudes than material incompatible with them. Hence he 
emphasized the “frame of reference” in which remembering occurs, con- 
flict with prevailing beliefs perhaps being more important than pleasant- 
ness and unpleasantness in controlling selective forgetting. Ý 

4. Repression induced by experienced failure. In a careful theoretical 
treatment of repression phenomena according to possibilities within 
theory, Sears (1936a) suggested a possible paradigm 
for the initiation of repression, and then proceeded to test it experimentally 
(Sears, 1937). The essential feature is that an excitatory tendency moving 
along a stimulus-response chain towards a consummatory response will 
be blocked and fail of consummation if, prior to consummation, a compet- 
ing tendency is also evoked. This competing tendency violates conscience 
or is a threat to self-esteem, and can arouse guilt or self-criticism. Thus the 
arousal of an anxiety response concomitantly with an anticipatory goal 
response may prevent the occurrence of the goal response, This is, in 
short, Sear’s paradigm for repression. His own experiment introduced a 
failure task (card sorting) between two sessions of nonsense syllable 
learning, The interference with the second list following failure was in- 


terpreted by Sears as an indication of repression.” 

Discouragement induced by failure might lead to a lowering of effort 
and hence lead to poorer performance on à later task without any active 
repression. Hence, as Zeller (1950a) has pointed out, a complete test of 
repression ought to show recovery when the associated threat is alle- 


viated, Several experimenters have attempted to demonstrate the two 


and Leeper (1936). 
biien ines failure-induced changes in recall and the 


ed-incompleted: task experiment may be noted. The 
ly been reviewed in Chapter 8. 


stimulus-response 


74 See, for example, Waters 

75 The family resemblance 
kinds of changes in the complet i 
Zeigarnik-type experiments have alrea 
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phases: induced repression followed by the lifting of the repression. One 
of the first of these was done under hypnosis (Huston, Shakow, and 
Erickson, 1934). Suggestions under hypnosis led to the attachment of 
guilt to certain words. In the waking state, while amnesic to the hypnotic 
experience, the subjects showed through word associations an influence 
on associations to the guilty words that might be interpreted as repression. 
Following removal of the suggestions under rehypnosis, the repressive 
effects disappeared. Diven (1937) used a conditioning method, following 
certain word pairs with shock. In the recall of the paired-associates, sub- 
jects did much better with the non-shocked than with the shocked pairs, 
as though recall of the “dangerous” words had been inhibited. After a “de- 
conditioning” series, recall proved superior for the shocked words, hinting 
that an unconscious (repressed) integration may be a strong one. An 
attempted repetition of Diven’s work has not been successful (Cannicott 
and Umberger, 1950). There are apparently subtleties in this kind of ex- 
periment, as suggested also by Sharp’s and Keet’s, that 

municated in the description of the experimental procedures, 

The initiation of repression through induced failure has been studied 
in an extension of Sears’ type of experiment by Zeller (1950b ) (1951), 
Russell (1952), and Aborn (1953). These experiments, with the ex, 
of Russell’s, found not only a disruption of rec 
recovery when threat was alleviated, 

Zeller used nonsense syllable learning and retention as the b 
interpolating success and failure experiences in imit 
cubes, His experimental design included two tests of retention prior to 
the initiation of “repression,” then an immediate and a delayed (48 hour) 
test while “repression” was active, and finally an immediate and delayed 
test after removal of “repression.” His results for both recall and relearning 
corresponded to predictions from Freudian theory, though he pointed 
out that alternative interpretations cannot be excluded, Zeller used four 
conditions, so that he was able to compare the inhibiting results of failure 
specific to a second task with the inhibiting results of failure on a second 
task that had become associated with the performance of a previously 
learned task. Repression was not found when the failure was highly 
specific, but it was found when failure on the irrelevant task b 
sociated with performance on the earlier learned one. 

Aborn (1953) added another dimension b: 
intentional (“set”) learning with those for in 
jects, reading the colored numbers on cards 
and Dvorine tests, thought they were taking 


are not com- 


ception 
all, but some subsequent 


asic task, 
ative tapping of Knox 


ecame as- 


y comparing the results for 
cidental learning, His sub- 
selected from the Ishahara 
a color-blindness test. Those 
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“set” to learn were told to remember the numbers that appeared on the 
cards; the “incidental” learners received no such instructions. The “ego 
threat” was introduced for the experimental subjects by following the 
series of numbered cards with others on which no numbers appeared, thus 
implying that the subject was color weak because he could not distinguish 
a number against the background of colored dots. The threat was en- 
hanced by having him fail on some presumably related perceptual tasks, 
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ssion and recovery in incidental learning associated with ego 
ree he th {pe of numbers learned incidentally while taking a color- 
blindness tests Mean 20 subjects in each group. Replotted from the data of Aborn 
(1958), page 228. 
s taken from the Arthur Performance Scale. As a test of 
rote down the numbers remembered from the color 


d the numbers on a recognition test. The memory 
tests were unexpectedly repeated two days later. At this point the “threat” 
was removed by revealing the true nature of the experiment: The memory 
test was again given unexpectedly two days later and again after three 
days. The results for the incidental learners are given in Figure 44. For 
the control subjects there is progressive decline (“forgetting”) with the 
passage of time. For the experimental subjects, however, there are reduced 
scores after failure, followed by recovery after removal of threat. The 


stencil design 
memory, the subject w 
test cards, then checke 
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higher scores at the end, for the experimental subjects, correspond to 
Diven’s finding, mentioned above. There were no differences of this kind 
for the subjects who had been given a set to learn; hence Aborn concludes 
that there are advantages in using incidental learning for this kind of 
study, and perhaps this fact helps to account for Russell’s (1952) negative 
results. 

Where do these studies lead us? The facts of amnesia, without addi- 
tional experiments, make abundantly clear that repression occurs, and that 
memories once lost can be recovered without relearning,”* While there- 
fore experiments are not needed to establish the phenome 
they may help us to delineate the precise circumstances under. which 


repression occurs. Clarification of repression phenomena is today more 
important than establishing their genuineness, 


na of repression, 


Repression and Recognition Thresholds 


While measures of recognition were used as tests of memory in some 
of the foregoing experiments, a rather different type of recognition experi- 
ment has recently come into prominence. In these experiments recognition 
refers to the perceptual recognition of well-learned material, as in the 
reading of a word under brief exposure. This differs from recognition as 
a test of memory for barely learned items. If the material to be perceived 
has connotations that are unpleasant, anxiety-producing, threatening, it 


may arouse perceptual defense, a warding off that results in higher th 
olds. The tendency not to perceive is, th 


against the aroused meanings." 
The contemporary series of experiments had roots in earlier 
ments but began most strikingly with a report by McGinnies 
He recorded simultaneously recognition thresholds 
responses (GSR’s) to neutral and emotional toned (taboo) words, pre- 
sented by a tachistoscope. During the prerecognition interval (i.e., while 
the exposure time was still so short that the subject could not name the 
word presented) the subject produced larger galvanometer deflections to 
the words he was going to have greater difficulty in recognizing than to 
the words which would prove eventually to have lower thresholds, The 
interpretation was that the GSR revealed anxiety connected with the taboo 
words, and that the high threshold for verbal response represented a per- 


ceptual defense against recognition of the forbidden words, 

76 Some good cases from ‘combat neuroses are given by Grinker and Spiegel (1945a) 
(1945b). 

7 Klein (1954), pages 268-270, quite properly argues against assigni ll i- 
tive controls in experiments of this kind to “defense.” 8 Ening all cogni 


resh- 
en, a form of repression, a defense 
experi- 
(1949), 
and galvanic skin 
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This study was followed almost immediately by criticisms and supple- 
mentary experiments, so that within the next three years a dozen reports 
appeared bearing upon it, and they have continued since.’* 

Some of the criticisms have to do with experimental logic, and some 
have to do with interpretations. Because all of the studies happen to treat 
of closely related substantive material, a review of a few of them will serve 
to indicate the complex causal factors that operate even within well- 
defined and restricted experimental contexts. 

The most telling criticism against McGinnies’ results as a demonstra- 
tion of perceptual defense lies in the difference between his emotionally- 
toned taboo words and his neutral words in frequency of use in ordinary 
speech, as objectively defined by magazine word-counts. The criticism, 
first proposed by Howes and Solomon (1950), has led to a number of ex- 
periments demonstrating conclusively that the more common words have 
lower recognition thresholds than less common words, quite apart from 
their affective significance.” 

Following up this criticism, Postman, Bronson, and Gropper (1952) 
equated critical and neutral words for frequency of ordinary use. They 
not only did not confirm McGinnies’ finding, but obtained the very op- 
posite result: lower thresholds for taboo words. While, therefore, the 
commonness of the words used must be controlled, does the total effect 
always disappear when that control is exercised? Not necessarily, Mc- 
Ginnies and Adornetto (1952) controlled word familiarity in experiments 
with schizophrenics, and found higher thresholds for taboo words, as in 


McGinnies’ original experiment. 
At least three additional determiners, besides frequency of word use, 


have been shown to affect recognition thresholds. 

1. If subjects are given incomplete information about what kinds of 
words to expect, the difference between taboo and neutral words is greater 
than if information is given in advance (Lacy, Lewinger, and Adamson, 
1953). Advance instructions obviously have nothing to do with experienced 
word frequency, though they may produce a readiness to perceive less 


familiar words. 

2. An increase i 
thresholds for related words, whethe 
mon ones (Wispé and Drambarean, 


3 o see Postman (1958). 
78 For a review through Oe oat p between frequency and threshold, see Howes 


79 For further work on the ETERO dL 1952 
1954), McGinnies, Comer, and Lacey ( ), Postman 
poa Solmon ( TOD E a E Howes (1951), Solomon and Postman (1952), 
Wispé and Drambarean ( 1953). 


n need-relevant motivation can influence recognition 
r the words are common or uncom- 
1953). They found the recognition 
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thresholds for need-relevant words reduced after 10 and 24 hours without 
food and water, even though the thresholds both at high and low need 
states were correlated with the commonality of the words. 

3. The nature of the choices confronting the subject can influence 
the results, apart from the familiarity of the words. Thus a forced-choice 
method, in which the subject was required to choose a syllable from a list 
to correspond to the briefly exposed one, produced more difference be- 
tween thresholds for shocked and non-shocked syllables than a free-choice 
arrangement (Lysak, 1954). 

These supplementary findings do not diminish the need to control 
word frequency, but they do suggest that the phenomena studied ex- 
perimentally are functions of many other circumstances, 

Another criticism of the McGinnies type of experiment is that de- 
liberate suppression may take place, producing reduced response but for 
reasons differing dynamically from Freudian repression. There may be a 
social hesitation to say taboo words, even though the words are recognized, 

Another line of investigation is related to McGinnies’ finding of GSR 
responses differentiating between neutral and taboo words at exposure 
times below threshold levels. The finding suggests renewed attack upon 
subthreshold perception, or subception, as it has come to be called, The 
issues are complex, but it usually turns out that there are sufficient supra- 
threshold features to account for most of the phenomena.** Jt is difficult 
even to know whether we are dealing with perception or 
interpretation based on previously perceived fragment 
(Lawrence and Coles, 1954), or whether we are de 
or verbal habits ( Neisser, 1954), 

In the attempt to remove some of the ar 


a retrospective 
ary information 
aling with perception 


tificiality of the taboo-word 


80 This criticism, and closely related ones, have been made by Howes and Solomon 
(1950), Lacy, Lewinger, and Adamson (1953), Lysak (1954), Postman, Bronson, 
and Gropper (1952), Whittaker, Gilchrist, and Fischer (1952) 

* See, for example, Bricker and Chapanis (1953), Lazarus and McCleary (1951 ). 

82? This meets in part Klein’s (1951) contention that we need to know more about 
the individual personalities of the subjects used in these experiments, In his experi- 


ments “high-interference” subjects sometimes react opposite to “low-interference” ones 
(Klein, 1954), 
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correlations obtained were not high, enough of them were significant to 
lend plausibility to the hypothesis that unacceptable needs tend to be 
defended against in perception.** 

The phenomena of repression have not yet been very much illumi- 
nated by experiments on perceptual defense any more than they have by 
experiments on the ease of recall. Patience is needed, however, and it is 
too much to expect that as these topics are newly brought into the 
laboratory we can achieve at once both richness and precision. 


ESTIMATE OF FREUD’S CONTRIBUTION TO LEARNING 


Freud’s Position on the Typical Problems 


The foregoing account shows many points at which Freudian theory 


has impinged upon the psychology of learning. A recapitulation is in 


order to summarize his views as they relate to the several problems with 


which learning theories deal. 


1. Capacity. As a developm ‘ 
that the very young infant is most impressionable,“ so that primary re- 


pressions occur in the first years of childhood, and character syndromes 
conflicts over food, toilet training, sex, aggression. 
flicts takes place according to fundamental 


themes or styles of response, and the rest of life is spent in playing out 
these early themes with the assimilation, of course, of new content, Fun- 
damental changes in the structure of the personality do not take place 
through ordinary education; such changes require the special kind of re- 
education provided by depth therapy. Alongside these modes of conduct 
determined by instinctual conflicts there are the conflict-free ego processes, 
Which are the kinds dealt with in ordinary give-and-take with the real 
ènvironment. At some stages of life the conflictual personal problems loom 
unusually large, and make difficult the adjustment to external reality im- 
plied in school learning. The easiest time for conflict-free learning should 
be during the “latency period,” and after adolescent conflicts are resolved, 
but individual differences are to be expected throughout because some 
children do not outgrow their earlier fixations, and some conflicts remain 
active and disruptive at any 48° mer So ; 
mpulsion is not a principle of learning- 


2. Practice. The repetition-co here , ; 
the compulsive behavior is peculiarly resistant to 
quences. The principle of learning through 


ental psychology, Freud’s theory implies 


find their origins in the 
The resolution of these con 


by-doing, for ordinarily 
change according to its conse 


83 See also Eriksen (1952); Lazarus, Om Ns 
84 This may possibly be related to “imprinting, 


Eriksen, and Fonda (1951), 
” as emphasized by the ethologists, 
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practice is better illustrated by “working through,” that aspect of therapy 
in which the patient faces the same conflicts over and over again in the 
process of re-education. Learning takes place in “working through” be- 
cause the conflicts are faced from new angles, and the cues to faulty con- 
duct become detected early enough so that the behavior can be deflected. 
Repetition is needed for learning, but for repetition to be effective it must 
be repetition with a difference. 

3. Motivation. Freud’s is chiefly a psychology of motivation, and he 
detected motivational control in kinds of behavior that others had thought 
of as trivial or accidental, such as minor forgettings and slips of speech. 
The first serious effort to incorporate psychoanalytic theory into general 
psychology recognized the Freudian wish as providing a theory of motiva- 
tion.® The motivational concepts that have impinged most directly upon 
contemporary learning theory are anxiety (as a learned drive), and the 
consequences of various ego threats, as in the studies of regression, ag- 
gression, repression, and the defense mechanisms generally. The emphasis 
upon sex as a master drive has been less influential in learning theories 
than in other fields of personality study, although many of the ego- 
threatening situations used have included sexual conflicts, 

4. Understanding. Despite Freud’s preoccupation with the irrational 
in human behavior, his theory lays great stress upon the possibilities of 
cognitive control. We can even gain insight into our unconscious processes, 
if we work at it properly—no mean achievement for the intellect, The 
developing “ego psychology,” playing up the “conflict-free ego sphere,” 
allows even more room for rationality in the control of conduct. 

The aim of psychoanalytic therapy is to get rid of self-deception and 
other blocks to rationality. To the extent that the methods are successful, 
they should provide some principles useful for learning and teaching. 

( 5. Transfer. We must not permit verbal equivalences to lead us to 
assume identity where identity does not exist. “Transference” has a special 
meaning in psychoanalysis. It refers to the special role that the therapist 
plays for the patient as the therapist stands from time to time for important 
people in the patient’s life, perhaps the mother, or father, or older brother. 
The patient reacts to the therapist with the emotions appropriate to these 
other people. Thus psychoanalytic transference does share with transfer 
of training the fact of generalization of responses learned in one situation 
to novel but related stimuli/ 

Problems of equivalence loom large in psychoanalysis. By way of 
symbols, something commonly stands for something else and provokes the 


85 Holt (1915). 
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responses (especially affective responses) appropriate to that something 
else. We have seen this process at work in displaced aggression. Other 
processes of symbolization and condensation are relevant to the manner 
in which earlier and later learning come into psychological relationship. 

6. Forgetting. Freud long championed the view that registration of 
early experiences persists throughout life, that forgetting is therefore 
chiefly the result of repression. Forgetting is mostly a kind of amnesia, 


and under appropriate circumstances (as in dreams) the persistence of 


these memories may be detected, while under other circumstances (as 
the repression may be lifted and the memories restored 
Although the most important repressions take 
d, the repressive process continues throughout 
d adding new items to the 


in psychoanalysis ) 
to waking consciousness. 
place first in early childhoo 
life, maintaining the original repressions, an 


unconscious store. 


Significance of Psychodynamics for Learning Theory 
One influence of psychoanalytic thinking has been to broaden the 
topical content studied within the field of learning. Psychoanalytical think- 
ing has helped to erase the boundaries between the neurotic and the 
normal, so that what was once relegated to “abnormal psychology” now 
becomes part of general psychology. Because there is a recognized con- 
tinuum between the neurotic and the normal, the learning of symptoms 
ning of mannerisms and the acquiring of 


becomes continuous with the lear equi 
attitudes, Symbolism in night dreams makes us look for symbolism in day- 


dreams, and in other products of creative imagination and thought. What 
we learn from unusual perceptual distortions, hallucinations, and amnesias 


influences how we think about ordinary perception in relation to the needs 
dinary forgetting as related to motivation. Hence 


the range of topics that interest students of learning has been extended 
to include perceptual distortion, repression, and SRR implying 
personal and idiosyncratic modes of expressing the resu ts o learning. 
In one form or another the conception of unconscious determination 
has made important changes in thinking about human motivation. Psy- 
chologists seeking to maintain the behavioristic orientation of the 1920's 


Were somewhat resistant to consciousness and even more so to an un- 
But even Watson was influenced strongly by 
he unverbalized” as a substitute for the 
-objective controversy did not turn out 
ting motives of which the learner 


of the perceiver, and or 


Conscious consciousness. 
Freudian doctrine and accepted “t 
Unconscious.** Thus the subjective 
to be a decisive one, so far as incorpora 


88 Watson (1924). 
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was unaware (or of which he could not speak). Freud can be credited with 
being the first to propose that repression leads to the inability to verbalize: 


Now, too, we are in a position to state precisely what it is that repression 
denies to the rejected idea in the transference neuroses—namely, the translation 
of the idea into words which are to remain attached to the object.57 

Any appraisal of psychoanalytic influences would be most incomplete 
if it did not acknowledge the emphasis upon unconscious processes and 
their derivatives. This is the most obvious way in which to acknowledge 
the dynamic contribution of psychoanalysis. 

Finally, the genetic or developmental aspects of psychoanalysis have 
brought to the fore the need for an adequate ego psychology.** If we are 
to understand the learner as he sets his goals and works realistically toward 
them, or as he is torn by conflicts that prevent his using his abilities, or 
as he burns himself out in the quest for futile objectives, we need a theory 
of personality organization incorporated within our general theory of 
learning. 

The goads from psychoanalysis to broaden the content studied within 
learning, to recognize the range of motivational determiners, including 
unconscious ones, and to build toward an ego psychology are acceptable 
ones, and the acceptance of these challenges has already proved reward- 
ing. A word of warning is in order. Psychoanalytic thinking is very com- 
plex and loosely formulated, so that it is not easy to find out what is es- 
sential, what is dispensable, and what internal contradictions there are in 
it. We need a careful, critical systematization of what appears to be best 
substantiated through clinical experience and through other sources of 
scientific evidence, so that irrelevancies and contradictions are either dis- 
carded or stated in form for decisive testing. We need to locate the most 
pertinent issues, and then to do our best to get adequ 


ate evidence to re- 
solve these issues, There is much in Freud th 


at nearly everyone will 
reject, such as the inheritance of acquired characteristics, and some meta- 
phorical statements about localization of function in the brain. The prob- 
lem is not to guess what Freud “really” meant and then to defend or refute 
him. The problem is to find out what is true, regardless of who said it. 
Freud uncovered some interesting hypotheses. We need to state them as 
carefully as we can, and then put them to the test. 


87 Freud (1915b), 133-134. Miller (1948b) has quoted these words 
a related theory of repression in relation to displacement. 

s See, for example, Symonds (1951). Misbach (1948), after a careful review of 
Freud’s statements, concludes that his contributions to learning theory lie largely 
in the emphasis upon ego organization as the background for learning. Misbach believes 
the pleasure-pain (law of effect) principles in Freud to be secondary. 


and developed 
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The Need for Collaborative Research 


If psychologists are to avoid working with pallid replicas of the events 
dealt with in psychoanalysis, some of their work will have to be carried 
on in collaboration with those experienced in depth therapy. The psy- 
chologist will have to gain such sophistication as he can through personal 
experience with the phenomena, while he also learns the necessary meth- 
ods and controls required to do research with them. Some of the problems 
will elude him unless he works closely (at least some of the time) with 


those who are dealing intimately with the more extreme manifestations of 


the phenomena—with emotional panic instead of the irritation of mild elec- 


tric shock, with traumatic amnesia instead of the repression of mildly taboo 
words, with transference of hallucinatory intensity instead of a comfort- 
able rapport, with the unusual states created under drugs and hypnosis in 
addition to those created by a few hours of hunger. It may be that some 
research problems in this field can be solved only with types of facilities 


r er 
that are but now coming into being. 
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FUNCTIONALISM 


j a eight learning theories we have examined in the preceding chap- 
ters include one stimulus-response theory that antedated behavior- 
ism (Thorndike), three varieties of stimulus-response behaviorism (Guth- 
rie, Skinner, Hull), one “purposive” or “expectancy” behaviorism (Tol- 
man), two variants of gestalt theory (classical gestalt and Lewin), and a 
psychodynamic theory (Freud). For the final major position to review, we 
return to the one out of which behaviorism grew but which flourished 
alongside it and continues to be well represented in contemporary psy- 
chology. It is the loosely formulated position known as functionalism, ap- 
propriately treated last because of its eclectic character and because its 
supporters have been free to learn from all of the other theories of learn- 
ing. 

While the name “functionalism” arose in connection with the brand 
of psychology developed by Angell and Carr at the University of Chicago, 
the name is equally applicable to the dynamic psychology that Wood- 
worth taught at Columbia. Boring considers Woodworth “perhaps the 
best representative of the broad functionalism that is characteristic of 
American psychology.” * Woodworth thinks of functionalism as a middle- 
of-the-road position, not really a “school” of psychology.? 


GENERAL CHARACTERISTICS OF FUNCTIONALISM 


Functionalism in one form or another has a long history, In Europe 
such men as Galton, Binet, and Claparéde could be classified as function- 
alists. In America, James, Hall, Ladd, Baldwin, Cattell, and Seashore were 
all functionalists in outlook. While we turn to the University of Chicago 
first because it was there that functionalism became defined, functionalism 
early in the century was “in the air” at Clark under Hall, at Columbia 

1 Boring (1950), page 722. 


2 Woodworth (1948), page 255. 
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under Cattell and Woodworth, and at Yale under Ladd, Scripture, and 
Judd. 


The Influence of John Dewey 


John Dewey (1858-1952) is the “founder” of official functionalism. 
His famous paper on the reflex-arc concept in psychology (1896), through 
its clear opposition to prevailing elementarism, marked the beginning of 
the new protest, One might have expected him to direct his attack against 
sensationism, for sensation was the fundamental element in the Wundt- 
Titchener psychology. Instead the attack was on the reflex-are concept, 
which, as Heidbreder has pointed out, was the very concept that had 
been introduced to move beyond a static description of elements of con- 
sciousness. There was, in fact, a controversy going on between Baldwin 
and Titchener over reaction time (involving the reflex arc) and the first 
experimental paper using Dewey’s formulations was a study of reaction 
time by Angell and Moore (1896), mediating the Baldwin-Titchener 
Controversy, t 

Dewey's main argument was that activity should not be thought of as 
starting with a stimulus, going through a central process, then emerging 
in a response. Instead, the activity is a complete cycle—a “reflex circuit” 
—in which the response may seek or “constitute” the stimulus. The rela- 
tion of the response to the posterior stimulus may be as important as its 
relation to the anterior one.° 

Functionalism was christened by Titchener (1898) when he attacked 

ewey’s position, contrasting functionalism with structuralism. The chal- 
lenge was a ccepted by Dewey’s younger colleague, James Rowland Angell 
(1869-1949), who became the leader of the new movement. We shall 


return to him presently. 
Dewey a cid ademit psychology to education before he left 


icago, and he went to Columbia in 1904 never again to concern him- 
self directly with the problems of laboratory psychology. But he had 
already left his mark on the functional psy chology that continued after he 
moved on, The animal laboratories and the studies of child development 
express the spirit of his Darwinism, with its emphasis upon the evolution 
of adaptive mechanisms.’ Not much of his later writing appears in psy- 


a 
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chology textbooks; later references in them are chiefly to his How we 
think (1910), which gave a practical and teachable account of some 
steps in problem-solving modeled after the steps in scientific investiga- 
tions. 

It would be a mistake, however, to think of Dewey’s influence upon 
the psychology of learning as limited to the work of those who developed ` 
functional psychology within the laboratories. At least three other lines 
of influence can be discerned, each bearing his imprint. 

The most marked of these, for its relevance to the psychology of learn- 
ing, is his influence upon ways of thinking about learning as the product 
of schoolroom practices. This influence began with his founding of the 
experimental elementary school at the University of Chicago,’ and the 
subsequent development known in its heyday as progressive education. 
His emphasis upon interest and effort, upon the child’s motivation to 
solve his own problems, represented a dynamic innovation for which the 
laboratory psychologists were not yet ready. Consequently Dewey’s chal- 
lenge was taken up by educational philosophers rather than by psycholo- 
gists.° Now it so happens that cousins may each resemble a common 
grandparent without resembling each other. Although Dewey's influence 
is clear in Carr’s Psychology (1925), and in Kilpatrick’s Foundations of 
method (1925), the detailed learning theories of these two books, the first 
a psychologist’s and the second an educator’s, have little in common. Pro- 
gressive education, at its best, was an embodiment of the ideal of growth 
toward independence and self-control through interaction with an environ- 
ment suited to the child’s developmental level. The emphasis was upon 
intelligent problem-solving (not upon learning through play), in which 
each child solves his problems by selecting appropriate materials and 
methods and by learning to adapt these materials and methods to his 
ends. His interest sustains his effort as he experiments with his solutions 


by testing them in action. The kinds of problems solved are social as well 
as individual, for education is envisa 


democracy through democratic living 
all sounds from the rules for memorizin 
minimum number of trials! 


ged as a preparation for life in a 
here and now. How different this 
g a list of nonsense syllables in the 


The second line of development, outside academic psychology, was 
upon sociological social psychology, by way of George Herbert Mead 
(1863-1931), who came with Dewey from Michigan to Chicago, Mead, 
whose views were very similar to those of Dewey, became the guiding 


7 For the early history of this school, with the oint of view that i i 
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spirit for developments in sociology paralleling those of Dewey in psy- 
chology and education. Mead is important for contemporary learning 
theory because he sponsored a form of self- or ego-psychology a whole 
generation before experimental psychologists were ready to take it up.° 
Mead’s interpretation of the self as arising out of social interaction is 
close to the spirit of Dewey the social philosopher and the defender of 
democratic values, who saw every act as the expression of a unified self 
seeking to resolve its conflicts through intelligent action in a world of 
objects and other selves. 

Finally, we have Dewey the philosopher, carrying on the pragmatic 
tradition of Peirce and James in a form known as instrumentalism or 
experimentalism. Even late in life he continued to write upon logic (1938 ) 
and upon epistemology (with A. F. Bentley, 1949), always deferring to 
the scientific method as the one that has helped man to acquire his firm- 
est knowledge. He insisted that ordinary mortals, without the scientist's 
equipment, could learn lessons from the method of science helpful in 
the conduct of their daily lives. 

A case could be made for bringing these diverging trends into a new 
synthesis that would have significance for the psychology of learning. 
Indeed, in ill-defined ways something of the sort may already be happen- 
ing, as we meet new emphases within learning theory upon ego aspects 
of learning, upon searching behavior and intelligent problem-solving, 
upon learning in social contexts. These new emphases supplement a prior 
Preoccupation with quantification and a somewhat constricted methodol- 
ogy. A new synthesis, strictly in the functionalist tradition, may legiti- 
mately acknowledge Dewey in its ancestry. History has a way of moving 
m cycles, Shortly before Dewey’s death his interpretation of perception 
as a transaction between the organism and the environment (foreshad- 
owed in the original paper on the reflex arc) began to find its way back 
into the literature of psychology.”* Possibly the rediscovery of other use- 
ful ideas may cause a re-evaluation of Dewey s place in the history of 


Psychology, 


Angell and His Successors 
development of functional psy- 
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Carr succeeded him at Chicago; Edward S. Robinson, a younger man, 
taught at Chicago before he went to Yale to carry on a similar program 
until accidental death cut short his career. Next to carry the mantle was 
John A. McGeoch, whose career, too, was ended by death at an early age. 
The students of Carr, Robinson, and McGeoch continue to be active, such 
men as Bills, Heron, Irion, McKinney, Melton, Waters, Underwood. Their 
students, in turn, keep the flavor of Chicago functionalism alive, although 
no one of those mentioned is now at Chicago, and several of them never 
were. 

In an early clear statement of the functionalist position Angell (1907) 
made three points: 

1. Functionalism is interested in the how and why of mental opera- 
tions as well as in the what (i.e., descriptive content). It is a cause-and- 
effect psychology, or in modern terms, an input-output psychology, con- 
cerned with mental operations in their context. 

2. Functionalism is essentially a psychology of the adjustment of 
the organism to its environment. Consciousness evolved to serve some 
biological purpose, to help the organism solve its problems especially when 
conflicts arise and habits no longer suffice. Once this position was taken, 
the way was open to welcome applied psychology: educational psychol- 
ogy, industrial psychology, mental hygiene. 

8. Functionalism is interested in mind-in-body, and so studies the 
physiological substratum of mental events. The implied dualism is a purely 
practical one (as in current psychosomatic medicine) and does not imply 
a special position on the mind-body problem. The only position that must 
be rejected, if one is to stress the adaptive role of consciousness, is that 
known as epiphenomenalism, the view that consciousness is a useless by- 
product of neural activity. 

Now that the quarrels within psychology have shifted somewhat, and 
the definition of psychology’s province is no longer a matter to fight about, 
the word “function” has tended to return more nearly to its mathematical 
use, as in the expression, y =f (x), (“y is a function of x”), This causes no 
great shift for the functionalist, for all he asked was to be free to study 
many sorts of contingencies, to find what depended on what, whether the 
dependency was upon age, species membership, prior practice, the in- 
gestion of a drug, or accepting the experimenter’s instructions. Neverthe- 
less, the shift is away from the earlier emphasis upon adaptiveness, upon 
purposes served. While the functionalist is free to work on applied prob- 
lems, the freedom is one of choice, and a functionalist may work with 
“pure” functions if he chooses. There is a kind of neutrality about the 
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mathematical function that makes it particularly appealing to the scien- 
tist; like the correlation coefficient, it expresses a relationship without 
requiring anyone to claim that it explains the relationship. If it does lead 
to a kind of explanation, there need be nothing teleological about it; there 
is no implication, with Voltaire, that God must have given man a nose 
in order to have a support for his spectacles. If some noses support spec- 
tacles more readily than others, or if some kinds of spectacles fit more 
kinds of noses, that is all that the functionalist is interested in finding 
out. 


The Background of Woodworth’s Functionalism 

Like Dewey, Woodworth was influenced by William James and G. 
Stanley Hall, although he studied only with James. His functionalism 
derived in part from them, and from his early association with Thorndike, 
Cattell, Ladd, and Sherrington. Both Angell and Woodworth became 
interested in imageless thought, and though they disagreed about it, the 
problems of imageless thought raised the question of meaning in rela- 
tion to consciousness, a good functionalist topic.** 

Woodworth’s functionalism gave considerably more place to motiva- 
tion than the earlier Chicago functionalism. The role of motivation was 
Prominent in Woodworth’s Dynamic psychology ( 1918), and his sugges- 
tions were heavily leaned upon when motivation became incorporated 
into Chicago functionalism by Carr (1925). ; 

Apart, then, from their sharing ancestry in men like James and Hall, 


the developments at Chicago and at Columbia were not intimately related 
but were mutually congenial, and with the passing of time the differences 
between them have gradually faded so that we can treat the general out- 
look as one, particularly as that outlook is reflected in contemporary psy- 
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He uses the words from diverse vocabularies, borrowing words freely from 
other traditions. He is not forbidden the use of older words because today 
they sound subjective (e.g., “idea,” “meaning,” “purpose”), or because 
they have occasionally been given systematic connotations that he does 
not accept (“sensation,” “image,” “ego”). He does not believe that any- 
thing is gained by new terms, unless advance in knowledge justifies the 
further precision that new words can bring. For example, he believes it 
premature to call all thinking “implicit verbalization,” for the objective 
terminology is not yet justified by what we know about thinking. Hence 
he holds to the older word. His definition of the field of psychology is also 
a tolerant one, and he is ready to accept information obtained by intro- 
spection, by objective observation, from case studies, from mental tests. 
He is tolerant as to method, and he is also tolerant as to content. The dis- 
tinction between pure science and applied science seems to him trivial, 
so long as either is good science. 

It is a mistake to confuse this broad tolerance with looseness, as 
though functionalism is merely an uncritical eclecticism. On the contrary, 
the functionalist is commonly a very astute critic. Because he has his eyes 
open for variables that may be ignored by more dogmatic systematists, he 
is not easily trapped into accepting “pat” systematic solutions for intricate 
problems. This critical attitude is expressed in a preference for relativism 
over absolutism. Recognizing the many determiners of psychological 
activities, Carr deplored what he called the quest for “constants,” by 
which he meant one-to-one correspondences, or psychological laws stated 
without reference to all the influential variables. 

2. The functionalist prefers continuities over discontinuities or typol- 
ogies. The mathematical statement of a functional relationship usually 
implies a gradual transition between the values of the dependent variable 
correlated with the independent variable as it increases or decreases. 
While extremes may differ so markedly as to appear qualitatively unlike, 
the functionalist looks for connecting in-between cases. This continuity 
of function was made the basis for a dimensional principle by McGeoch, 
and the suggestion has been elaborated by Melton. 

The two chief classes of variation recognized by Melton are those 
dependent upon experimental arrangements, which may be called situ- 
ational dimensions, and those that depend upon the psychological func- 
tions involved, which may be called process dimensions. 


12 Carr (1933). 


14 McGeoch (1936), Melton (1941a) (1950). The dimensional principle was applied 
earlier to a modern version of the attributes of sensation by Boring (1933a), 
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The situational dimensions call attention to the differences between 
what the learner is called upon to do from one experiment to another. A 
very different problem is presented to the learner by a path through a 
maze, the rotating target of a pursuitmeter, a list of syllables on a mem- 
ory drum, a reflex to be attached by conditioning to a new stimulus, and 
a mathematical problem to be solved by reasoning. 

The process dimensions are correlated in a coarse way with the 
Situational dimensions, but are distinguishable. They include such con- 
siderations as the amount of discovery required before the correct re- 
Sponse is made (a distinction between rote learning and problem-solving), 
the involvement of motor or ideational responses (as between motor skills 
and verbal memory), and the relevancy of motivating conditions (inci- 
dental learning as against intentional learning). 

Experiments have to be classified according to more than one dimen- 
Sion. Thus rote learning and reasoning experiments are alike on the di- 
mensions yerbal-nonverbal, but unlike with respect to the amount of 
discovery involved in the adequate response. It is important to recognize 
that the classification is not according to fixed types, but always according 
to scales that have intermediate values. Learning is not blind on the one 
he other; there are degrees of understanding in- 


and and insightful on t > i 
at one extreme to a maximum at the other, with 


V ee 
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Most cases falling between these extremes. 
3. The functionalist is an experimentalist. In its modern form, func- 


tionalism is dedicated to the experimental method. The issues upon which 
the functionalist is so tolerant become a part of his science only when 
they are translated into experimentable form. He is free of constraint in 
iS Choice of dimensions, and may choose to set up a dimension of items 
Sraded for similarity or a dimension of tasks graded according to degree- 
PPunderstanding, He prefers to drive general issues back to specifics 
efore he is led into controversy over them, for he believes that many 
linguistic difficulties fade when reference is made to the specific findings 
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recognized by leading functionalists.?* The preference for environmental- 
ism shows up largely in the treatment of perception, where the issue as- 
sociated with the differences between Helmholtz and Hering has been 
kept alive in differences between gestalt psychologists and associationists. 
The functionalist prefers to interpret perceptual discriminations as 
learned.** Although these biases are not absolute ones, when there is room 
for doubt (as in some aspects of size constancy) one can predict that the 
functionalist will be on the side of empiricism and against nativism. 

In summary, we find the functionalist tolerant but critical, favoring 
continuities over discontinuities, seeking to translate his problems into 
experimentable form. Within the free, eclectic atmosphere, he neverthe- 
less has a preference for interpretations coherent with historical associa- 
tion theory as against holistic or nativistic interpretations. 


FUNCTIONAL ASPECTS OF THE LEARNING 
PROCESS 


Because the functionalist does not have a highly articulated learning 
theory, we can best understand his approach by following a functional- 
ist’s analysis of the problems facing an experimental psychology of learn- 
ing. In what follows we shall accept Melton’s (1950) summary of learning 


as a representative functionalist statement, using his analysis of learning 
and his outline of topics. 


(2) 


(4) 
FIGURE 45. The readjustive process. After Dashiell (1949), page 26. 


Melton begins with Dashiell’s diagram of the readjustive process 
(Figure 45). The diagram shows how a problem arises when ongoing ac- 
tivity (1) is blocked by an obstacle (2), Eventually the organism through 


15 Carr (1931), Robinson (1932a). Melton (1950) describes his position as an 
“gssociationistic functionalism.” 

16 E.g., Carr ( 1935). The favorite experiment quoted by functionalists has long been 
that of visual experiences with reversing lenses, Stratton (1896). 
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varied behavior (3) solves the problem (4) and proceeds on his way (5). 
If the process is repeated, the adequate response (4) recurs in less time, 
with less excess and irrelevant activity. 

Thus the learning process is primarily a matter of the discovery of the 
adequate response to a problem situation and the fixation of the satisfying 
situation-response relationship." 

From this analysis, Melton is led to state several major experimental 
problems of learning. Expressed as topics, these are: 

- Motivation 
. Initial discovery of the adequate response 
. Fixation and elimination 


. Factors determining the rate of learning 
. Transfer of training and retention 


We can follow this outline in characterizing the functionalist ap- 


Proach to these topics. 


UUA 0 N He 


l. Motivation 

Woodworth was the first functionalist to give a treatment of motiva- 
tion in modern form. in a small book entitled Dynamic psychology (1918). 

It was this book that gave currency to the term “drive” as alter- 
native to “instinct.” Woodworth set the stage for the prominence that 
drives and motives were presently to have within psychology. 

The cycle between aroused activity and quiescence is discussed ac- 
cording to preparatory and consummatory reactions, Woodworth crediting 
Sherrington 18 with the distinction. Consummatory reactions are those 
Which satisfy basic drives or needs, activities such as eating or escaping 
from danger, They are directly of value to the organism. Their objective 
Mark is that they terminate a series of acts, leading either to rest or to the 
turning by the organism to something else. A preparatory reaction is only 
indirectly or mediately of value to the organism, its value resting on the 
fact that it leads to or makes possible a consummatory reaction. The ob- 
Jective mark of the preparatory reaction is said to be that it occurs as a 
Preliminary stage leading to the consummatory reaction. 

Preparatory reactions are of two kinds. One kind represents alert 
attention, “a condition of readiness for a yet undetermined stimulus that 
May arouse further response.” Another is the kind of preparatory adjust- 
ment evoked only “when the mechanism for a consummatory reaction 
as been aroused and is in activity.” The latter sort of reaction is goal- 


1, Melton (1950), page 670. 
herrington (1906). 
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directed, and evokes seeking behavior, as when a hunting dog, having lost 
a trail, explores about in order to get back on it. 

A new conception introduced is that of ongoing activity as itself a 
drive—activity once under way leading to its own completion. Even if 
the original drive under which the activity started has ceased to operate, 
the activity, if incomplete, may continue. The distinction between drive 
and mechanism breaks down when it is asserted that the mechanism (i.e., 
the reaction under way) may become its own drive. Allport later (1937) 
proposed that motives may become functionally autonomous, that is, com- 
pletely divorced from their primal sources in basic needs of the organism. 
Activity leading to further activity, and functional autonomy, are very 
similar conceptions. Both are related to the problem of secondary rein- 
forcement. If stimuli associated with reinforcement themselves come to 
have reinforcing power, it may be that a system of derived rewards may 
be built up which will continue to have motivating power without re- 
course to the satisfaction of any basic need. The increasing importance 
attaching to secondary reinforcement in theories like Hull's and Skinner's 
shows Woodworth’s perspicacity in being the first to recognize that the 
distinction between drive and mechanism becomes less sharp as activity 
gets under way. 

Another facet to Woodworth’s motivational theory is the abiding 
interest that he has shown in the concept of “set,” dating from his early 
experiments on imageless thought. It was the new emphasis brought about 
by the imageless thought controversy that in the first decade of this 
century provided functionalism’s supplementation to traditional associa- 
tion psychology. Although Angell and Woodworth were on opposite sides 
of the fence on some issues concerned with the introspective contents of 
thought, they were in agreement on the dynamic consequences of pre- 
disposing sets and intentions.2® 

The sets and intentions emphasized by the students of imageless 
thought, and the preparatory reactions oriented toward consummatory re- 
sponses are concepts with a similar dynamic loading. They are both di- 
rectional, they are both inferred from the manner in which behavior se- 
quences are organized. In a later development of his conception of set, 
Woodworth introduces what he calls the situation-sct and the goal-set 
(1937). The situation-set refers to adjustments to environmental objects; 
the goal-set refers to the inner “steer” which gives unity to a series of varied 
but goal-directed activities. 


19 Woodworth (1906); Angell (1906). 
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The important suggestion in the concept of situation-set is that the 
environment must be brought into the psychological account in a manner 
not suggested by the usual stimulus-response psychology. Guthrie, for 
example, makes much of the fact that he is interested in movements, not 
in outcomes. Woodworth takes the opposite position. Movements are, to 
Woodworth, less important than the information about the environment 
to which these movements lead. The rat in the maze learns spatial re- 
lationships, not running habits; in the problem-box the cat discovers the 
door-opening character of the button, not a sequence of manipulatory 
movements. As Woodworth puts it: “The character of goodness attaches 
for the animal not to his movements in manipulating the button, but to 
the button as an object.” 

It is to Woodworth’s credit that his long history of espousing stimulus- 
Tesponse concepts did not so fix his thinking that he was unable to go 
Outside this systematic framework as he felt it necessary. The conception 
of situation-set presents a kind of bridge to ways of looking at psychology 
that have been carried out more fully by others, notably by Tolman and 
Brunswik, Through the years, Woodworth’s use of “set” and related con- 
cepts has prevented his falling into a static system. 
ivation in his textbook (1925), was in- 
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Scribed as involving a motivating stimulus, a sensory situation, and a 
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is called the incentive, or the immediate objective or goal of the response. 
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phase as a sensory change became Carr's way of talking about the em- 
pirical law of effect.” 

Melton, accepting Carr’s definition of motivation as a stimulating 
condition, points out that motivating conditions have three functions: (1) 
they energize the organism, making it active, (2) they direct the variable 
and persistent activity of the organism, and (3) they emphasize or select 
the activities that are repeated (fixated) and those that are not repeated 
(eliminated). Stated in this way, the functionalist’s interpretation of mo- 
tivation is consistent with Hull’s, but it is coherent also with the position 
of Tolman and Lewin. 


2. Initial Discovery of the Adequate Response 


Melton points out that initial discovery may be the main problem in 
some kinds of learning (e.g., in the problem-box ), while it plays little part 
in others (e.g., in serial rote memorization). The controversy between 
trial and error and insight in the interpretation of learning hinges in part 
on the nature of this initial discovery of the correct solution. The function- 
alist does not have to take sides: he waits for definitive experiments. 

Melton notes that two other forms of initial discovery have played 
less systematic roles in learning theories, yet they should not be neglected. 
One of these is guidance. The discovery of the correct response can be 
facilitated by the teacher or trainer through appropriate manipulation of 
the environment or the learner. Hence guided learning is intermediate 
between rote learning (with no discovery) and problem-solving (with un- 
guided discovery), An extensive research program on guided learning was 
carried out by Carr and his students and later summarized by him.** Such 
a program illustrates the freedom with which the functionalist selects his 
experimental variables, 

The other form of initial discovery mentioned by Melton is imitation. 
He believes it can be subsumed under guidance, however, for it helps the 
learner to discover the solution by observing the solution by another 
organism. 

Melton, in common with other functionalists, is willing to accept 
the initial discovery of solutions with insight, but, in common with other 
associationists, prefers interpretations emphasizing past experience: 

Insight is, however, not a term to conjure with even though it is a useful 
descriptive concept when employed within the framework of the major dimen- 


sions of analysis of the problem situation and the process of discovery of the 
correct response. In any event, the range, specific nature, and plasticity of the 
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trial behavior of the organism seem to be most profitably related to the motivat- 
ing condition of the organism and to the associative spread from previous learn- 


ing.** 
3. Fixation of Adequate Responses and Elimination 
of Inadequate Responses 
Melton accepts the empirical law of effect as stated by McGeoch: 
Other things being equal, acts leading to consequences which satisfy a 
selected and strengthened, while those leading 


motivating condition are 
a motivating condition are elimi- 


to consequences which do not satisfy 
nated.” ** When it comes to explaining effect, Melton is cautious in choos- 
ing between a contiguity theory and a reinforcement theory: 
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Some guiding principles are needed to establish order in an em- 
pirical program of such vast scope as accounting for everything that affects 
all kinds of learning by all kinds of learners under all kinds of circum- 
stances. Some of these guiding principles were inherited from the pre- 
experimental association psychologists. A useful list had been provided by 
Thomas Brown as early as 1820. He had classified the laws of association 
(or laws of suggestion, as he called them) into primary or qualitative 
laws (the laws of similarity, contrast, and contiguity), and secondary or 
quantitative laws (frequency, vividness, emotional congruity, mental set, 
etc.). The primary laws represent the essential general conditions for as- 
sociative formation or associative revival, whereas the secondary laws 
determine which of many possible associates is formed or recalled. 

After Ebbinghaus (1885) had shown that associations could be 
studied experimentally, writers such as Miiller (1911-1917) in Europe 
and Carr (1931) and Robinson (1932a) in America began to rewrite as- 
sociative laws in quantitative form. Carr and Robinson were quite explicit 
that all associative laws could be made quantitative, and they gave lists 
of laws that they thought expressed our twentieth-century knowledge 
about associative learning. 

Here is Robinson’s list of laws: *° 


Law of contiguity 

Law of assimilation 

Law of frequency 

Law of intensity 

Law of duration 

Law of context 

Law of acquaintance 

Law of composition 

Law of individual differences 


While in topic some of Robinson’s laws sound qualitative, he attempts 
to show how all qualitative distinctions can be subjected to a dimensional 
analysis. He himself raised the question whether or not in isolating these 
factors as important he had really stated “laws.” He summarized his dis- 
cussion of “laws” as follows: 


It is my firm conviction that the facility of associative fixation is a function 
of all of the factors enumerated. Probably several specific relationships are 
involved for each named factor and almost certainly there are other factors 
that have not been included in this list. But, if the assumption that these factors 
are important determiners of association be correct, then there are “Jaws” of these 
factors whether our knowledge of them is definite or not.2* 


28 The laws are stated in Robinson (1932a), pages 62-122. 
26 Robinson (1932a), page 124. 
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It turns out that Robinson’s position is much like Skinner's, in that 
he points out the relevant variables and makes a claim for “lawfulness” 
rather than for “laws.” Because of these limitations in the list of “laws,” 
the Carr-Robinson laws did not catch on, even among their students, as 
a means of ordering the factors determining the rate of learning, although 
there are occasional mentions of isolated laws from the list. The issues 
expressed in the laws were somewhat too general. The empirical role of 
contiguity," for example, breaks down into the study of several varieties 
of time relationships between events, with their corresponding gradients, 
Thus the separation of a conditioned and unconditioned stimulus in clas- 
sical conditioning yields one kind of empirical law of contiguity, while 
the delay of reward at the end of a maze yields another kind. We tend 
to name such relationships more concretely (e.g., gradient of reinforce- 
ment, spread of effect) rather than to refer them to a more general law 
of contiguity. So, too, the law of frequency, as a law of relative frequency, 
covers too many topics, including the form of the learning curve, the form 
of the work curve, the effects of distributed and massed practice, the 
consequences of degrees of overlearning. Frequency is only a general 
topic, within which a number of very different laws are discoverable. 

Contemporary functionalist writers, instead of setting up basic laws 
of learning, tend to use classificatory schemes derived directly from ex- 
perimental arrangements and results. Underwood, for example, uses the 
following scheme to outline the factors determining the rate of learning: 


a. Massed versus distributed practice 
b. Type of material S 
(1) Intra-list similarity 
(2) Meaningfulness 
(3) Affectivity 
c. Knowledge of performance i ' 3 
d. Miscellaneous (e.g, whole versus part learning; active recitation; sense 
modality; amount of material) °$ 
While Underwood's treatment is part of a chapter in an introductory 
textbook on experimental psychology and is not intended to be exhaustive, 
his classification of variables bears a close resemblance to that used in 
other functionalist treatises on learning.” Such a classificatory scheme is 


not in itself a theory of learning, yet it suggests as well as the Carr-Robin- 
son laws the kinds of variables with which functionalist systems deal. 


27 istingui from the fundamental axiom of contiguity as in Guthrie’s 
ie Pe a aber of contiguity is a dimensional one, concerned with 


degrees of contiguity. 
28 Underwood (1949), pages 898-419. s , 
AA A iterature handled functionally can be found in McGeoch 
2° Major summaries of the litera! eae cath nal Schlosberg (1954). 


and Irion (1952), Osgood (1953), 
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5. Transfer of Training and Retention 


' Once something has been learned it can be used, provided it has not 
been forgotten and provided new, situations recur in which the previously 
learned behavior is called forth. The study of the relative permanence 
of learning, when tested in situations essentially duplicating those of the 
original learning, is the study of retention, while the study of the effects of 
old learning in new situations is often discussed as transfer of training. 
Because recurring situations always recur with differences, there is ob- 
viously an intimacy between retention and transfer. Because transfer ef- 
fects may be positive, negative, or indeterminate, one view of forgetting 
is that it is but an illustration of negative transfer. 

Melton follows McGeoch (1932a) in accepting two major laws of 
forgetting. The first is the law of context (one from Robinson’s list), which 
asserts that the degree of retention, as measured by performance, is a 
function of the similarity between the original learning situation and the 
retention situation. The second is the law of proactive and retroactive in- 
hibition, which asserts that retention is a function of activities occurring 
prior to and subsequent to the original learning. Proactive and retroactive 
inhibition have been major topics of research for several decades, ever 
since the problem was first opened up by Müller and Pilzecker (1900). 
A number of good summaries of the literature are available. 

The paradigm for retroactive inhibition is A—B—A, where the 
learning of B is interpolated between the learning and retention of A, and 
interferes with the retention of A. The paradigm for proactive inhibition 
is B—A—A, where the learning of B prior to the learning of A interferes 
with the later retention of A. Both retroactive and proactive interference 
with learning are readily demonstrable, and the empirical relationships 
have led to a number of hypotheses. Studying the development of one of 
these hypotheses will help us to understand not only retroactive inhibition 
but the manner in which functionalists construct their theories. 

One set of problems arises over the similarity between the interpolated 
material and the material originally learned. Robinson (1927), arguing 
from some earlier results of his own and of Skaggs (1925), formulated a 
hypothesis later christened by McGeoch as the Skaggs-Robinson hypothe- 
sis. With the usual functionalist preference for stating dimensions, Robin- 
son proposed relating the amount of retroactive inhibition to the dimen- 


80 For the earlier literature, see Britt (1935) and Swenson (1941 


A ). A general sum- 
mary can be found in McGeoch and Irion (1952), 404-447, 
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sion of degree of similarity between the original and the interpolated 
material or activity. 

With the similarity dimension in mind, Robinson argued that the 
interpolation of identical material (Material B the same as Material A) 
would simply provide additional practice on Material A, hence lead to 
increased retention on the test trials during which retroactive inhibition 
is usually shown. Because retroactive inhibition with dissimilar materials 
was already an established fact, the natural conjecture on the assumption 
of continuous variation is that starting with identity the amount of retro- 
active inhibition would increase gradually as dissimilarity was increased. 
Now, asks Robinson, what is likely to happen at the other end of the 
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DEGREE OF SIMILARITY BETWEEN INTERPOLATED 
ACTIVITY AND ORIGINAL MEMORIZATION 


FIG 2 46. Similarity 1 factor in retroactive inhibition, The curve is intended 
to Renate cae 7 hibition bears a quantitative rea ge to the degree of 
similarity between the interpolated activity and the material originally memorized. 
With maximum similarity, the interpolated activity provides positive transfer, hence 
ll. Maximum interference with recall is predicted to 


increases the efficiency of reca faximui c al 
all at sanie tevetaaed ie value of similarity. After Robinson (1927), page 299. 


Scale, as the original material (Material A) and the interpolated material 
(Material B) become extremely unlike? Presumably retroactive inhibition 
represents some sort of interference based on similarity between the 
original and interpolated activity. If there is very little similarity there 
should be very little retroactive inhibition. Putting all these considerations 
together, it is reasonable to expect a maximum of otaackve inhibition at 
some intermediate point of similarity between Materials A ei B. Robin- 
son formulated the whole generalization in words as follows: As similarity 
between interpolation and original memorization is reduced from near 
identity, retention falls away to a minimum and then rises again, but 
with decreasing similarity it never reaches the level obtaining with maxi- 
mum similarity.” He expressed this graphically by the figure reproduced 


as Figure 46. 
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His own experimental test of the generalization was very simple. By 
the memory span method he studied the recall of the first four of a series 
of eight consonants as this recall was interfered with by the last four of 
the consonants. That is, the first four were considered to be Material A, 
the last four Material B, and the similarity and dissimilarity of Materials 
A and B were controlled. Similarity was defined as partial identity. Maxi- 
mum similarity meant that the second four consonants were like the 
first four; maximum dissimilarity meant that all the last four differed 
from the first four. 

The results confirmed the hypothesis only partially. Starting with 
near-identity, retroactive inhibition increased as the interpolated material 
became increasingly dissimilar. This was part of the conjecture. But the 
decrease in the amount of retroaction (increase in recall) with maximum 
unlikeness was not found. In fact, with the materials totally dissimilar, 
retroactive inhibition was at a maximum. 

A number of later experimenters attacked the problem after Robin- 
son formulated it in this way.** Results gave partial confirmation, but 
there was some incompleteness and minor contradictions appeared in the 
data. The results led Osgood (1949) to propose a more complex formula- 
tion of the relationships involved. Osgood’s solution of the similarity 


problem is diagrammed in the three-dimensional surface shown 


as Figure 
47. 


What Osgood’s surface states is that the amount of transfer in positive 
or negative directions is a function of shifts in similarity between learn- 
ing and retention of both the stimulus conditions and the response re- 

` quired. Shifts in stimulus similarity are from front to rear as noted on the 
right-hand margin, moving from identical stimuli (Sr) through similar 
stimuli (Ss) to neutral stimuli (Sy). Shifts in response similarity are repre- 
sented from left to right, as noted along the back margin, with identical 
responses (Rr) being at the left, and moving progressively through similar 
responses (Rs), neutral responses (Ry), partially opposite responses (Ro) 
to directly antagonistic responses (Ra). 

The best way to practice reading the diagram is to read its edges first. 
The rear edge says that stimuli bearing no resemblance to those used in 
original learning lead to new responses without any transfer effect, positive 
or negative, regardless of the degree of resemblance between the required 
responses and responses that have been used in earlier experiments. The 
front edge says that with identical stimuli there will be maximum positive 


» 31 Mention should be made also of a more 


rational analysis of the form of the 
Skagg-Robinson function by Boring (1941). 
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transfer with identical responses (for this is merely overlearning ), whereas 
with directly antagonistic responses there will be maximum interference, 
for the earlier responses will have to be completely unlearned or over- 
come. The curve moves with two inflections between these extremes, but 
these inflections are not actually determinate from present data. The left 
edge says that, with identical responses, shifts in stimulus similarity from 


+ 


SSi 


> ood’s transfer and retroaction surface (after Osgood, 1949). 
Wn te ee E oont of transfer (+) or interference (—), with neutral zone 
represented by a plane (0). Left to right, amount of shift in response similarity 
etween original task and new task, from identity (Rr) to antagonism (Ra). Front to 
rear, amount of shift in stimulus similarity between original and new task, from 


identify (S1) to neutrality (Ss). 


identity to neutrality will result in decreasing transfer, but no interference 
in new learning. The right edge says that, for antagonistic responses, shifts 
in stimulus similarity from identity to neutrality will produce decreasing 
interference, but no positive transfer. The diagram is a surface, and yields 
a curve wherever it is cut by a vertical plane. 
In Robinson’s kind of experiment the response called for varies with 
the stimulus, so if the stimulus changes the response also changes; (lhis 
is typically true in serial rote learning, because the response item is also 
a stimulus item for what follows.) Hence the Skaggs-Hobinson diagram 
(Figure 46) can be approximated by making a diagonal slice across 
Osgood’s surface (Figure 47), from the near left-hand corner to the rear 
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point marked Ry. The near left-hand coner represents the Skaggs- 
Robinson condition of maximum task similarity (identity of both stimuli 
and responses between learning and testing), while the rear point repre- 
sents neutrality of both stimuli and response relative to the original task. 
Such a diagonal curve will be high at the left, dip down into a zone of 
interference, and return to the neutral point at the rear, thus having the 
essential characteristics of the Skaggs-Robinson curve, though with some- 
what different inflections. 

We may trace the evolution of the models to account for the role of 
similarity. First we have the experiments that demonstrate the existence 
of retroactive inhibition, and then some that demonstrate the possible role 
of similarity as a factor. Next we have Robinson’s somewhat crude dimen- 
sional hypothesis, leading to a series of experiments. Meanwhile a series 
of related experiments on transfer effects are being conducted, as though 
the two phenomena have little in common, Presently Osgood offers his new 
synthesis, covering the data that have accumulated since Robinson's 
hypothesis was announced, incorporating at once the retroactive inhibition 
and the transfer results. The Skaggs-Robinson hypothesis is seen to be a 
limited special case of a more complex set of relationships. This succession, 
with an interplay between data and theory, and a higher synthesis in 
dimensional terms, is exactly what might be expected in a maturing 
functional analysis. The next step, as Osgood points out, ought to be a 
quantitative (mathematically precise) model of his surface. While the 
surface now accounts in a general way for all of the available data, it is 
largely schematic, and nothing is known about the precise form it should 
take, or if, indeed, there is one form or a family of forms. 

The discussion of Melton’s topics has led us somewhat afield from the 
content or conclusions of his own treatment of learning, but the outline 


served its purpose in providing the topics for an overview of the functional- 
ist analysis of the problems of learning, 


Does this emerge as a theory of learning? Melton emphasizes in his 


summary that a dimensional analysis of learning, although scarcely qualify- 
ing asa theory, is not anti-theoretical. Theoretical systems, he states, help 
to clarify problems and they lead to the discovery of new empiric: 
lationships. But these relationships have to be firmly established if we 
are to have a science of learning. We need to standardize and calibrate 
valid and reliable methods of investigation in order to have better d 
for the theorist to manipulate. 


al re- 
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EFFECTS OF THE DISTRIBUTION OF PRACTICE AS 
ILLUSTRATIVE OF CURRENT FUNCTIONALISM 


One of the best ways to study a scientific theory is to see it in action, 
that is, to note how those who hold to the theory select their topics and 
design their experiments and how they analyze and interpret their data. 
We have followed this procedure in reviewing the theories thus far dis- 
cussed. We have had some introduction to the functionalist at work on 
the problem of retroactive inhibition. Now we are ready to examine 
further how functionalism works through giving more detailed considera- 


tion to some selected experimental problems. 


How the General Orientation Contributes to the 

Planning of Experiments 

When we attempt to determine how the theory directs the experi- 
ments that the functionalist chooses to perform we note two characteristics 
of his approach. 

We discover, first, that the general theory provides only an orienta- 
tion to experimentation, without providing a framework of crucial laws 
and postulates. The dimensional approach is a very broad frame of ref- 
erence within which to work. It would be entirely unmanageable and 
cumbersome were it not for the guiding threads provided through years 
ation by familiar methods upon familiar topics. Consider 


of experiment é i 
in studying all the dimensions of massed and 


what would be involved ; 
distributed practice. There are all the variables common to learning ex- 


materials, interrelationships within the 
ask), and there are the special variables 
specific to the problem of distribution (location of the interval between 
items, between trials, between banks of trials, between learning and re- 
tention, within relearning; length of the interval, whether uniform or 
variable; nature of the activities filling the interval). The combinations 
and permutations are staggering. , i 

The second thing that we find out is that the experimenter typically 
narrows his task (within the broad dimensional orientation). He does 
this in two chief ways. First, he falls back on tasks that have familiarly 
been used in these studies, chiefly rote memorization and motor learning. 
He further restricts himself by addressing his attention n some of the 
“special theories” that are current or that he may himself invent. Hence 
the functionalist depends not only upon his general orientation but upon 


periments (subjects, nature of 
materials, the kind of learning t 
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a body of experimental lore and a number of special theories that reduce 
his task to manageable proportions. 


The Testing of Special Theories Related to 
Distributed Practice 


In order to illustrate the foregoing interpretation of how the func- 
tionalist goes about his experiments, we shall examine a number of ex- 
periments studying distributed practice. One series of twelve experimental 
reports, by Underwood and his associates, well illustrates the nature of a 
systematic functionalist investigation.*? While these investigators appear 
to be making a strictly dimensional analysis, the papers are in fact guided 
largely by several special theories, and the dimensions chosen are by no 
means all that might have been selected. We turn now to some of these 
special theories—some tested by Underwood, and others with which he 
was not concerned, 

1. The theory of differential forgetting (McGeoch).** McGeoch’s 
theory states that in the course of memorization or practice a subject learns 
erroneous (conflicting) responses that retard progress. Because these 
errors are less well learned than the correct responses, lapse of time will 
permit the errors to be forgotten more rapidly than the correct responses, 
Hence distributed practice will be advantageous over massed practice, 
In serial memorization some of these errors are remote associations (place- 
skipping tendencies). Because only the correct responses are repeatedly 
reinforced, the correct responses are probably better fixed than the errors, 
Experimenters who have looked for the forgetting of errors, as implied 
in McGeoch’s theory, have been unable to find them. In one of the most 
careful direct studies of the problem, Wilson (1949) found no fewer errors 
in distributed than in massed practice, even though distributed practice 
resulted in more rapid learning, and he failed to find predicted effects in 
retention tests. Later studies, eg, Underwood and Goad (1951) have 
also failed to substantiate McGeoch’s theory. 

2. The generalization-differentiation theory (Gibson).** Gibson as- 
sumes that in the learning of a nonsense syllable the subject has to dif- 
ferentiate the individual syllables from nonsense-syllables-in-general. If 
the subject is making some correct responses and some erroneous ones, we 
may say, according to this interpretation, that the correct responses have 


32 These are Underwood and Goad (1951), Underwood (1951la) (1951b), Under- 
wood and Viterna (1951), Oseas and Underwood (1952), Underwood (1952a) 
(1952b) (1953a) (1953b) (1953c) (1953d) (1954a). 

38 McGeoch (1942), page 142f.; McGeoch and Irion 


(1952), pages 183-188. 
34 Eleanor J. Gibson (1940) (1942). 
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been better differentiated from syllables-in-general, while the erroneous 
responses still suffer from too great generalization, that is, lack of discrim- 
ination, What will be the effect of a rest interval? Because differentiation 
(by analogy with what goes on in conditioning experiments) involves an 
inhibition of alternative responses, rest should bring “spontaneous re- 
covery” of generalization. Hence the differences in the strengths of tend- 
ency to give correct and incorrect responses should be reduced after rest. 
Thus distributed practice should prove to be disadvantageous as compared 
with massed—an implication contrary to McGeoch’s hypothesis. 

Experimental results are more favorable to Gibson’s theory than to 
McGeoch’s. With distributed practice, erroneous responses commonly 
do show an increase over those in massed practice, especially during the 
early trials,’ 

How difficult it is to make predictions that are genuinely unam- 
biguous! We assumed, in stating Gibson’s theory, that distributed practice 
should be disadvantageous. This was not really a sound assumption. It has 
been found that with distributed practice there are more errors, as her 
theory predicts, but learning may still be more rapid under conditions of 
distributed practice. It is possible that more errors are associated with a 
-heightened set to respond that is in turn associated with more rapid learn- 
ing. Part of the mystery of the finding disappears if we remember that 
in addition to false responses there are other kinds of inadequate re- 


sponses in a serial learning experiment, such as a failure to respond, not 


counted as an erroneous response. l 
Another aspect of Gibson’s generalization hypothesis has to do with 


the intra-list similarity of items. If items in a list are very much like each 
other it will be harder to differentiate them. If the process of learning is 
one of differentiation, it should be harder to learn a list of similar (homo- 
geneous) items than a list of heterogeneous ones. This implication of the 
theory can easily be confirmed experimentally.** A second implication is 
that forgetting will be more rapid with greater intra-list similarity because 
of the spontaneous recovery of interfering generalizations. The interfer- 
ing generalizations will, of course, be more conmo with more similar 
items. This implication does not receive experimental support. Because of 
the support of Gibson’s theory in the learning and relearning data, but 
not in the retention data, Underwood (1954b) was led to propose some 


supplements to her theory. 

35 Underwood and Goad (1951), onae oere) (1952b). But not all experi- 
E i .g., Underwood (195la). 
ee There is enba ae ie here between Gibson’s theory and Kéhler’s. See 


Page 246. 
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The results of four of Underwood's studies ** (with both serial- 
syllable and paired-syllable lists) agreed substantially on the following 
three points: (1) The higher the intra-list similarity the more difficult the 
learning; (2) retention over a 24-hour interval is independent of the level 
of intra-list similarity; and (3) the higher the intra-list similarity the more 
difficult the relearning. 

Underwood is called upon to account for the unexpected failure of 
intra-list similarity to have an effect upon retention, although it clearly 
affects both learning and relearning. He proposes that there may be two 
kinds of interference that do not reveal themselves in retention because 
they balance each other: 

(1) If intra-list similarity is high, there is interference within the 
list, as predicted from Gibson’s generalization hypothesis. Underwood's 
subjects learned and relearned three successive lists. His evidence shows 
a distinct trend for errors at recall to come from within the list when 
similarity of items within the list is high, and from outside the list when 
similarity of items within the list is low. Because some of these outside items 
came from previously learned lists, it is implied that there is inter-list inter- 
ference when intra-list similarity is low. 

(2) If intra-list similarity is high, the list being learned is more 
clearly differentiated from previously learned lists than if similarity is 
low. This accounts for less inter-list interference with high intra-list simi- 
larity. 

Combining these two propositions we note that there is an inverse re- 
lationship between intra-list and inter-list interference with variations 
in intra-list similarity. Thus when intra-list similarity is high, the high 
intra-list interference is offset by low inter-list interference,** Provided 
these opposing tendencies balance each other, we have an explanation for 
the finding that recall is independent of the degree of intra-list similarity. 

Evidence supporting these conjectures is given in F. igure 48. The curves 
are based on the learning of successive lists of paired associates. Altho 
early in practice the high-similarity lists are much harder to learn than 
the low-similarity lists, the curves converge with practice, High-similarity 
lists become relatively easier both because there is some learning-how-to- 
learn and because there is little inter-list interference. This accords with 
the hypothesis. Low-similarity lists become harder to learn, not because 
learning-how-to-learn is absent, but because there is increasing inter- 


ugh 


31 The four studies are Underwood (1952b) (1953a) (1953b) (1953c). 


88 That better bounded material interferes less than loosel bounded i 
shown earlier by Werner (1947). á PRO anata S756 
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ference (proactive inhibition) from the learning of prior undifferentiated 
lists. ( This, too, agrees with the hypothesis. ) 

The interpretation is that, as one of these interferences goes up the 
other goes down. Hence they may balance so that retention does not show 
any differential effects for high or low intra-list similarity. If these are 
balancing functions, it ought to be possible to give one a boost over the 
other. In one of Underwoodď’s experiments in this series,” it was found 
that with extreme similarity learning remained difficult, but retention was 
improved. This result, too, conforms to his hypothesis. 


INTRA-LIST SIMILARITY 


MEAN TRIALS TO RELEARN 


STAGE OF PRACTICE 


paired-syllable lists as a function of stage of prac- 


F. D arning of 
_ FIGURE 48, Relearning Forwood (1954b), page 163. 


tice and intra-list similarity. Un 

8. The theory of recovery from work decrement. A very plausible 
theory of the advantages of distributed practice has been proposed from 
time to time by those who noted that massed practice had some of the 
characteristics of a “work curve” as well aa learning curve.” This is 
particularly evident in motor learning. In Figure sia for example, we have 
the performance curves of two groups on a pursuit rotor, one: group of 
Subjects practicing for one minute ae resting for three minutes (dis- 
tributed practice), the other practicing for three minutes and resting 
for one minute (massed practice). Scores were recorded at the end of 
each minute of the 3-minute group, and these appear in the curve as a 
series of “work curves.” The gain from practice shows as recovery over 
the 1-minute rests, but there is little advantage in the three times as much 


°° Underwood (1958c). 
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practice that this group has had. The kind of theory that these curves 
suggests, supplemented by “warm-up” phenomena, has been given a 
more formal treatment by Ammons (1947), and works reasonably well 
for pursuit learning.*° 

The related theory for rote memorization was elaborated by Hull and 
others (1940), largely around the experimental work of Ward and Hov- 
land.“ Here it was assumed that interferences accumulating within learn- 
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FIGURE 49. Distributed practice in pursuit learning. Group A practiced 1 mi 
and rested 3 minutes, while Group B practiced 3 minutes alk mel 1 chan 
Scores at each minute within the 3-minute trials show progressive work decrement. 
Doré and Hilgard (1987). è 


ing are of the nature of inhibitions in conditioning, and thus recover with 
time. A number of conjectures were borne out in the data: that there 
should be reminiscence after a rest following learning, that the disad- 
vantage for items near the middle of a list should be overcome somewhat 
with rest, that paired associate learning should show less benefit from rest 
than serial learning. 

The main objection to the theory was raised by one of its proponents 
—Hovland (1940)—when he pointed out that the theory should predict 
that massed practice ought to lead to better retention than distributed 

40 Further work along closely related lines has been done by a number of others, C2 


Kimble and Horenstein (1948), and Irion (1949), 
41 Ward (1937), Hovland (1938) (1939). 
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practice. The argument is straightforward: In order to meet a criterion of 
mastery under inhibitory conditions, more has to be acquired than under 
less inhibitory conditions. Hence the amount retained, when the inhibi- 
tion vanishes, should be greater for the group that learned under a handi- 
cap, that is, the massed-practice group. 

Two remarks need to be made about Hovland’s objection to the 
theory. In the first place, the advantages of distributed practice for reten- 
tion are by no means universal. In this respect some of the data gathered 
by others support Hovland’s theory better than his own.*? In motor learn- 
ing it is not uncommon for the groups with scores depressed by massed 
practice to gain more following both short and overnight rests than groups 
with distributed practice.** In the second place, no single-factor theory 
can be expected to account for the complex data within all phases of the 
distributed practice problem. A theory cannot be completely validated by 
data that agree with it, nor can it be completely refuted by data that dis- 
agree with it. This follows if we admit that there may be competing 
processes operating, and in any experiment one of these processes may gain 
ascendancy over the other. We saw this possibility in Underwood’s sup- 
plementation of Gibson’s theory. 

Theories of the work-interference type, whether expressed in terms 
of conditioned inhibition, or reactive inhibition, or in other language, are 
bound to have some place in the explanation of distributed practice. They 
account well for the similar effects found for such unlike tasks as nonsense 
syllable memorizing, pursuit learning, and concept formation. Direct 
tests have heretofore been made largely with the motor tasks, but Hovland 


and Kurtz (1951) have come up with an experiment in which the results 


of work are tested directly within a memorization experiment. If a subject 


does difficult mental work of another kind just before memorizing a list, 
more reminiscence for the list follows than if he either did not do the 


mental work or was permitted to rest between the mental work task and 


the memorization. PER ai $ 
4. The consolidation theory. One of the earliest “special” theories 


bearing on distributed practice is that of Müller and Pilzecker (1900), 
known originally as the perseveration theory. The theory assumes“that 
the benefits of practice continue for a while after practice ceases, and the 
results of learning are therefore not at their maximum until an appreciable 
time following the end of practice. Were the perseveration theory to hold, 


“2 E.g., Underwood (1952b). f 
Hi gard ond Seatth (1942), Cook and Hilgard (1949). 
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such phenomena as reminiscence and the advantages of distributed pra- 
tice would be expected. Related forms of the theory attribute the consoli- 
dation effect to growth or maturation stimulated by practice.** 

One hypothesis derived from the theory of stimulation-induced matu- 
ration is that, within the wide limits of amount of practice, learning should 
proceed at a rate that is a function of the learner and of the task to be 
learned rather than a function of the number of trials or of the actual 
amount of time spent in practice. Too little practice (or, at the extreme, 
none at all) will fail to stimulate the growth underlying the skill, while 
too much practice will prove to be disruptive and inefficient. 

The hypothesis has been criticized because it is said that its predic- 
tions may be made “on purely empirical grounds.” ** That is, if we know 
that under some circumstances rest can produce as much gain as practice, 
there must be an intersection point at which two practice curves will 
overlap when plotted according to elapsed time, even though one curve 
is based on fewer trials than the other. While this is true, the fact that 
whole families of curves coincide, even within very wide ranges of dif- 
ference in the spacing of practice and rest, is not so easily predicted. The 
two curves from Doré and Hilgard already presented (Figure 49) show 
how the recovery from the detrimental effects of 3-minute practice periods 
brings the scores to about the level reached when practice and rest are 
very differently spaced. The corresponding scores for the beginning and 
end of each of four days of practice under three widely different con- 
ditions of spacing are shown in Figure 50.4 Whatever one’s theory, it is 
pertinent to note that the gains within each day and the overnight losses 
are remarkably similar despite the accumulated differences in the number 
of practice trials. The ordinary trial-by-trial plot would have shown strik- 
ing advantage for distributed practice without calling attention to a uni- 
formity that no other theory predicts. 

The validation or refutation of a theory is no simple matter. Although 
the results presented conform to the consolidation theory, other results 
from the same laboratory show no such correspondence to theoretical 
predictions.“ When some predictions from a theory succeed and others 
fail, the failures mean that the theory, if not wrong, is at least incomplete. 

Other suggestions. It is characteristic of the functionalist to invent 


44 Snoddy (1935), Wheeler and Perkins (1932). 

45 McGeoch and Irion (1952), page 183. 

46 Similar curves for mirror-vision co-ordination as studied by Snoddy (1945) were 
replotted in the first edition of this book, Hilgard (1948), page 356. 


47 Bell (1942), Wright and Taylor (1949). The latter experiment was done with 
verbal learning. 
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ad hoc theories to explain particular findings. This tendency in itself is 
neither good nor bad, although at some stages of knowledge it produces a 
great conglomeration of unrelated theories, and inconsistencies are hard 
to detect. 

We may conclude this treatment of special theories by pointing out 
three suggestions not covered in the foregoing list of special theories. 
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The first of these suggestions is that of Cain and Willey (1939), who 
studied the retention of nonsense syllables following learning to a com- 
Mon criterion of mastery by two groups. The massed-practice group 
learned the list in a single day. The distributed-practice group learned 
Over three days, each day’s practice being interrupted when the subject 
reached a preselected criterion short of mastery, but | higher 
each day. By the third day the distributed-practice su jects ter hed the 
same criterion of mastery that the massed-practice subjects zerched in 
one day. The group that learned by distributed practice turned out to 
be far superior in retention to the massed-practice group, as tested at 
intervals of 1 day, 3 days and 7 days following learning. Partly because the 
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distributed-practice group showed no advantage over the massed-practice 
group in trials required to learn, Cain and Willey argued from the 
distributed-practice group’s superior retention that this group must have 
learned something the massed-practice group had not learned. This was 
to pick up the task on the following day where it had been left off many 
hours before, even when it had been poorly learned. In other words, the 
distributed-practice group had two days of practice at recalling after 
24 hours that the massed-practice group did not have. While this is a 
plausible suggestion within the arrangements of this experiment, it is 
hardly suitable as a general interpretation of the advantages of distributed 
practice. It gives a warning, however, that qualitative changes occur 
when very long intervals are dealt with. It is always necessary to supple- 
ment mere scores by a psychological analysis of the activities going on. 

A second suggestion was originally made and tested by Ericksen 
(1942). He proposed that massing of trials should lead to variability of 
response, through factors similar to the refractory phase that act as 
barriers to repetition. If this is the case, massing should be advantageous 
when the task is one calling for variability of attack. He found some evi- 
dence for this theory for human subjects solving a problem-box. Earlier 
work by Cook (1934) had found massed practice to be an advantage in 
early trials of puzzle-solving, a result coherent with Ericksen’s hypothesis. 
A later test by Riley (1952) failed to find benefit from massing in a task 
with many alternatives; distributed practice proved beneficial through- 
out. 

A third suggestion comes from Underwood (1954a), calling attention 
to individual differences as a factor in the effects of distributed practice. 
He found that retention by slow learners tended to suffer from distributed 
practice as shown in Figure 51. While he believes these results to account 
for some of the discrepant findings on the relation between distribution of 
practice and retention, the results do not suffice to account for all the 
discrepancies. The differences between the two kinds of learners are to be 
taken seriously, but the failure to find advantages with distributed practice 
even for the fast learners shows that something else is operative in these 
experiments.** . 

These suggestions (and they do not exhaust the possibilities) show 
how complex a dimensional approach would become if it were to take 


48 One possibility is that the symbol cancellation task used to fill the interval has an 
interfering effect upon retention, and that this interference is more severe with the 
slow learners. The experiments that find more advantage for distribution have used 
chiefly color naming, shown to be somewhat facilitating as against symbol cancellation. 
( indervood, 1952a). 
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seriously the task of exploring in systematic variation all of the factors 
that have been shown by one or another experiment to be influential in 
experiments on the distribution of practice. 


Concluding Comments on the Distribution of Practice 


An examination of experiments and theories related to distributed 
practice has helped us to see some of the characteristics of functionalism 


at work, First, the general position provides an orientation towards dimen- 
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Sional experimentation. If intervals are important, then we explore an 
orderly series of intervals; if similarity is important, then we find a way 
of grading similarity and we explore the effects of several degrees of 
Similarity, Second, however, the attack is always much more specific than 
the general orientation alone would suggest. Not all or just any dimensions 
are explored, for such an approach would be too cumbersome; instead, 
the methods and materials used tend to be restricted largely to familiar 
Ones, and the particular dimensions chosen for investigation are usually 
related to one or more of the special theories that are current. The pro- 
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gram is very much that of science in general. It a very similar to that 
proposed by Skinner, to the extent that it is dimensional and seeks lawful- 
ness rather than laws; it is related to Hull in that the tests are commonly 
tests of hypotheses. It lies between these two positions in that the ex- 
perimenters who are functionalists usually have more respect for hy- 
potheses than Skinner has, and they are more cautious than Hull about 
general systematic formulations at this stage of knowledge. 

Does the approach pay off in firm scientific knowledge? The answer, 
at least if we generalize from studies of distributed practice, is by no 
means unequivocal, We are left at present with a welter of special theories, 
mostly tested with undecisive results. What is the matter? 

One embarrassment is the failure of later experimenters to agree in 
their findings with the results of earlier experimenters. If we cannot find 
reproducible experimental data, it is futile to try to reach agreement on 
theoretical predictions. Possibly these failures to repeat are due to slip- 
shod experimental procedures, but this is unlikely, Most experimenters 
are careful in what they do and report their results accurately. A more 
plausible diagnosis of failure to come up with identical findings when 
experiments are repeated can be made along the following lines: 

1. Experiments are seldom repeated without “small” or “trifling” 
changes. Strictly speaking, we are not interested in repeating earlier ex- 
periments precisely, for then we get no new information. Hence we in- 
troduce what we believe to be non-essential changes (such as a different 
task in rest intervals), hoping to reproduce the essential findings. If the 
same findings are indeed the outcome, no problem arises. But if we do not 
find the same results, then we do not know whether or not we would have 
found them had we not made the “trifling” changes, Because of the many 
difficulties in so-called repetition, it is essential that we be patient enough 
to repeat exactly what other experimenters have done before we begin to 
make “small” changes.*° 

2. Experimental designs are permitted to become too complex, with- 
out prior psychological analysis of the processes under investigation. Ex- 
perimental design is not solely a statistical problem. While of course re- 
sults must be secured in a manner to make possible adequate statistical 
analysis, the subject's task must also be reckoned with. The counterbal- 


49 Those not close to the experimental literature have no id 
repeatability of experimental results has become. A notable example is the Ward- 
Hovland reminiscence phenomenon, which led Buxton (1948) to ascribe to it the char- 
acter of “now-you-see-it-now-you-don’t.” Underwood (1953e) found so little evidence 
for it in his review of later studies that he designated it as possibly “a prewar phe- 
nomenon.” 


ea how serious this non- 
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ancing of practice effects by using the same subjects in various cycles 
of an experiment is likely to prove not to be an economical procedure. 
Underwood’s results as presented in both Figures 48 and 51 show how 
futile it is to try to equate effects at different stages of practice when 
subjects of different ability are used and when different kinds of tasks 
(e.g., paired associates versus serial learning) show very different effects 
early and late in practice. The simpler design, well represented in the 
experiment of Wilson (1949), in which each subject is used in but a 
single condition, gives all the advantages of applicability of modern 
analysis of variance statistics while permitting much better control of the 
psychological processes involved. It has the further advantage that new 
controls can be run without going through an elaborate counterbalanced 
series in which one cannot fathom what confusion has been created for 
the subject. 

These two suggestions are related. It is too costly to repeat exactly 
very involved counterbalanced designs. If simpler designs were used, it 
would be possible to repeat the particular cycle in which the next ex- 
perimenter was interested, in order to show that under the original con- 
ditions the original results could be obtained. Then the experimenter is 


free to make any changes he wants, with some assurance that his changes 


bear upon the modified results that he obtains. 
A second source of embarrassment is that even the special theories 


appear to have a long life and seldom are disposed of by way of experi- 
mental evidence. The only special theory related to distributed practice 
that now seems ready to be dismissed is that of differential forgetting, for 
no experimenter has found the error items that he ought to find if the 
theory is true. The consolidation theory is not usually treated as very 
promising, yet no theory has been proposed to predict concretely the re- 
sults that happen to fit it. The theories based on warm-up and recovery 
from work decrement satisfactorily explain some, but not all, of the re- 
sults. So, too, Gibson’s generalization-differentiation theory remains in 
the running with some substantial agreements between data and predic- 
ti ilures also. 

K ae ss functionalist’s credit that he can tolerate this 
mass of material so loosely integrated, rather than blinding himself to 
some of it in order to make his theories sound more plausible. Yet it 
would be even more to his credit if he could invent a theary that would 
give a coherent account of the whole array of relationships. We may note 
a dilemma here. Special theories can be stated with great precision, and 
So they have that advantage over general theories, at least in early stages 
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of experimentally established fact. But special theories can also be ap- 
plicable more than one at a time. Hence it is almost impossible to find 
a crucial experiment to test a special theory. If the experiment agrees 
with the theory, that is fine, but we all know that you cannot prove 
theories through occasional agreements, but can only keep them in the 
running that way; if the experiment disagrees with the theory, the theory 
may still be true, but something else may be in conflict with it. There is 
no point in giving up any theory with which occasional agreement can 
be found until there is some more general theory that accounts for all 
the agreements. The hypothetico-deductive method is by no means as 
simple as at first blush it appears to be! 


ESTIMATE OF FUNCTIONALISM 


Current functionalism is not a unified theory such as those earlier 
discussed. What bounds of unity there are lie in a tolerant acceptance 
of a wide range of psychological phenomena and a conviction that the 
task before psychology is to subject the many variables to quantitative 
study. Such a methodological unity permits wide diversity in content. 


The Functionalist Position on the Typical Problems 
of Learning 


Within such an unstructured system there is no one clear answer 
to most problems of learning. The answer to be expected is that “it all 
depends on conditions.” In practice the situation is not quite as free as 
this implies, for gradually there accumulate more commonly accepted 
generalizations. 

1. Capacity. Robinson recognized individual and species differences 
in his laws of individual differences and of composition. It is in line with 
functional developments for McGeoch and Irion to include in their book 
a chapter on learning as a function of individual differences. They believe 
that the increase of learning ability with age is best accounted for on the 
basis of two hypotheses: first, organic maturation, second, changing psy- 
chological conditions (transfer, motivation, personality traits), 

2. Practice. The law of frequency was to Robinson a law of relative 
frequency, which therefore recognizes the losses in score when practice 
is overcrowded, along with the gains when trials are more appropriately 
spaced. There is a tendency to emphasize the form of the learning curve 
and to seek the conditions under which one form rather than another is 
to be found. There is, however, no diatribe against a law of exercise, 
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3. Motivation. Woodworth’s dynamic psychology places motivation 
at its core. Carr accepts in principle the preparatory-consummatory se- 
quence, assigning motivation the role of a continuing stimulus to be termi- 
nated by the goal-response. The concept of “set” enters into the more con- 
ventional experiments on memorization and skill as a motivational 
supplement to the more familiar laws of association. It was the preoc- 
cupation of the early functionalist with such tasks as the learning of rote 
verbal series which tended to place motivation in the background rather 
than the foreground of theories such as Robinson’s. 

4. Understanding. While the associationist recognizes that meaning- 
ful material is more readily learned than nonsense material, degree of 
meaning is but one of the dimensions upon which materials can be 
scaled. Hence he does not believe that problem-solving or insight require 
interpretations beyond ordinary associative learning, The organism uses 
what it has learned as appropriately as it can in a new situation. If the 
problem cannot be solved by analogy, the behavior has to be varied 
until the initial solution occurs. Insight is perhaps an extreme case of 
transfer of training.” : 

5. Transfer. Following Thorndike, transfer falls chiefly under the 
law of assimilation. That is, transfer depends upon degree of likeness 
between the new situation and the old. Woodworth reinterprets the 
theory of identical elements to mean only that transfer is always of con- 
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variables. Its laws are quantitative, directly descriptive of data. There 
is a healthy respect for data, and there is a commendable urge to state 
issues specifically in a form subject to test. The relativism brings with it 
a freedom from bigotry. Before his untimely death, Robinson had turned 
to social problems, with the conviction that the same methods would 
work there. So long as people were forced to think in terms of specifics 
he believed that they could often reach agreement, even though on 
larger issues they were swayed by prejudices and preconceptions. 

The disadvantage of an extreme empiricism and relativism lies in its 
lack of articulating principles to cut across empirical laws. What results 
is a collection of many “laws,” without hierarchial structure. There is no 
economical multidimensional apparatus for fitting together the various 
two-dimensional functional relationships, each of which is necessarily 
cast in the form: “other things being equal.” If the dimensional program 
of the functionalists were fulfilled we should have a large handbook of 
data, with each of the several laws illustrated by a number of graphs 
showing the variations of associative strength under specific conditions. 
Empirical multidimensionality would be achieved through experimental 
designs testing several variables at once, but unless some simplifying 
steps were taken, the possible combinations and permutations of con- 
ditions would mount fabulously. Dimensional analysis puts data in 
order for exposition and for verification, but in itself does not connect 
the data into an economical scientific system. Such a system has to be 
logically structured as well as empirically sound. 

Functionalism is in some respects an eclectic position. It is eclectic 
on the problem of introspection versus behavioral description, by ac- 
cepting both in the account of psychological activity. A favorite illustra- 
tion is that less attention to details is required after a skill is mastered, 
Knowledge about the representation of details in awareness requires an 
introspective report, while the mastery of the skill may be studied by 
observation of the overt movements. Functionalism is eclectic on the 
problem of blind versus intelligent learning, accepting a continuum be- 
tween these extremes, according to the dimensional analysis of Mc- 
Geoch and Melton. Any point of view which is pragmatic and pluralistic 
can easily incorporate concepts from alien systems; hence functionalism 
is well suited to play the mediating role. Functionalism is, however, an 
eclecticism with a bias—the bias of associationism in favor of analytic 
units, historical causation, environmentalism. The bias is more evident in 
Chicago functionalism than in that of Woodworth, who has shown him- 


54 Sec, for example, Robinson ( 1935). 
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self friendlier to less associationistic concepts. New data may be ac- 
cepted but forced back into older concepts not fully appropriate. Thus 
some of the novelty of the observations made in insight experiments is 
lost when insight is treated as merely another illustration of the familiar 
transfer of training experiment. Similarly, what goes on in the reason- 


ing process is not fully encompassed by treating reasoning as trial-and- 
arther out on the dimension of explicit- 


error learning merely a little f 
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implicit response. Those who dis 
object to the functionalist’s incorporating of new experimental findings 
without accepting the theoretical implications of these findings. This 
is a problem with which the eclectic always has to wrestle: how much 
can be taken over without incorporating its systematic context? The 
functionalist has not been greatly concerned about the inner consistency 
of his borrowings because he is less concerned than others about the 
inner consistency of his own concepts. What consistency there has been 
is provided by the framework of associationism within which the func- 
tionalist works. 
Perhaps Melton’s position is the best to take on the matter of fune- 
tionalism’s systematic position: the functionalist does not have an articu- 
lated system, but he would like to have one, and is not anti-theoretical. 
His breadth and self-criticism prevent his embracing a comprehensive 
system until facts are better ordered. ; r 
Whatever the systematic limitations may be, it must be recognized 
that great energy for experimental study has been released within the 


functionalist group. The books of McGeoch and Irion (1952), Osgood 


(1953), Woodworth and Schlosberg (1954) amply attest to this. Out of 
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theoretical backgrounds.** To the functionalist, operationism is a very 
natural development from James’ pragmatism and Dewey’s instrumental- 
ism. 

2. Mathematical models that are “neutral” with respect to many 
historical controversies are essentially “functional” models, in the mathe- 
matical sense of function. This mathematical sense of function is coherent 
with the psychological meaning of function. Hence it is not surprising to 
find Skinner (1953a) writing about the need for a functional analysis 
(meaning essentially a dimensional analysis), or to find Brunswik (1955) 
describing his system as a “probabilistic functionalism.” The mathe- 
matical models to be described in the next chapter can properly be de- 
scribed as functional ones. 

3. Miniature systems, in which restricted realms of data are sum- 
marized according to a few special theories, are also congenial to the 
functionalist. There is a heuristic pluralism involved in functionalism, 
not unrelated to James’ pluralism. That is, in order to move forward in 
the interpretation of data it is sometimes necessary to accept provisional 


(heuristic) interpretations that permit you to move ahead with the analy- ` 


sis. A case in point is VTE (“vicarious trial and error”) as proposed by 
Muenzinger and followed up by Tolman. This rather limited segment 
of behavior (vacillation at the point of choice) is given a name and 
its conditions studied. The many systematic questions (Is it vicarious? Is 
it trial and error?) can be postponed until some of the functional rela- 
tionships are explored. At this stage there is a pluralism, because there 
exist ad hoc concepts not related by definition or equations to the funda- 
mentals of an established system. 

Contemporary psychology, to the extent that it is increasingly op- 
erational, and increasingly concerned with mathematical models and 
miniature systems, reflects the general outlook that, in one form or an- 
other, has all along characterized the functionalist. 
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Chapter 11 


THE EMERGENCE OF 
MATHEMATICAL MODELS 


CIENTIFIC description depends upon accurate observation. To make 
S observations more precise, a scientist commonly uses recording instru- 
ments and then makes measurements with the aid of them, He records his 
measurements in numerical form, whether by counting, or ranking, or 
scaling. As he treats his data in order to discover lawful relationships 
among them he naturally turns to mathematics as the language designed 
to state relationships among the kinds of events to which numbers can be 
assigned. This is all so familiar, especially in the physical sciences, that 
mathematical expression for scientific relationships is taken for granted. 

The use of mathematics in the biological and social sciences is neither 
so widespread nor so successful up to the present time as it has been in 
the physical sciences. Hence we cannot be sure that all biologists or all 
social scientists would today assent to the proposition that ultimately their 
sciences must state their laws in mathematical form. We need not seek 
agreement on this proposition, however, in order to study such progress 
as there has been in the use of mathematical models in these fields, 

Learning data are well adapted to mathematical treatment. Relation- 
ships expressed in the form of learning curves or retention curves can 
also be expressed by equations. Because the quantitative effects of a num- 
ber of variables are known, further equations can be written. We have 
already become acquainted with one system in which mathematical ex- 
pression has been used throughout (that of Hull, in Chapter 5), and an- 
other in which an adaptation of mathematical ideas takes a central place 
(that of Lewin, in Chapter 8). We are ready, therefore, to review the 
place of mathematics in the study of learning, and to note particularly 


some recent developments. 
368 
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EMPIRICAL AND RATIONAL CURVE-FITTING 


When Ebbinghaus (1885) found that his curve of retention was 
roughly logarithmic in form, he fitted a logarithmic equation to it: 


100k 
(log t)° + k (1) 
where b = per cent retained by the savings measure 


t = elapsed time 
c, k = arbitrary constants obtained by curve-fitting 


Thus the very first quantitative study of associative learning intro- 
duced mathematical formulation. 

Ebbinghaus used the procedure that has since come to be known as 
empirical curve-fitting. When we follow this procedure, we first plot the 
experimental data and then look for some kind of mathematical function 
likely to fit it. This function represents a family of curves, say of hyperbolas, 
or parabolas, a growth function, or what not. Then we go through the 
added step of determining the constants (corresponding to the c and k of 
Ebbinghaus’ formula) in order to find that particular member of the 
curve family that comes nearest to fitting the obtained data. The com- 
monest method for doing this is known as the method of least squares. 
What this means is that we seek that member of the curve family that 
does as well as possible in passing near our experimentally determined 
points, the measure of success being the sum of the squares of the devia- 

We try to make this sum a minimum, 


tions of the curve from our data. 
quares.” If we can find another curve family whose 


best fitting member produces a smaller sum of squared deviations, this 


second curve is a better fit, and is generally to be preferred. 
The main point in empirical curve-fitting is that we select the curve 


family solely on the basis of fit, and not on the basis of any theory, Ebbing- 
haus set out with no assumptions and required no theory that forgetting 
ought to follow a logarithmic curve; such a function merely proved to be 
a convenient summary expression of the facts ha discovered. Life insur- 
ance companies predict the likelihood of survival on the basis of such 
empirical functions; they call them actuarial curves because they are 
based upon actual recorded facts. There need be no theory whatever as 
to why older people are more likely to die than younger ones. l 
When rational curve-fitting is contrasted with empirical curve-fitting, 
the word “rational” implies that the family of curves is chosen according 
to some theory or theoretical deduction so that it is logical to expect the 
curve as plotted to be of a given form. Rational curves, because they arise 


b= 


hence the name “least s 
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from and test theories, are more elegant than empirical ones. A rational 
curve of slightly poorer fit may turn out to be more useful for purposes of 
science than a better-fitting empirical curve. The step of curve-fitting 
has to be taken with rational curves as well as with empirical ones. That 
is, while the form of the curve is determined by theory, the particular 
member of the curve family that fits best is determined by the method 
of least squares, exactly as in empirical curve-fitting. Thus the constants 
(or parameters, as they are called) of the rational equation are deter- 
mined empirically. The rational theory demands either that these con- 
stants should be interchangeable from one experiment to another or that 
they should vary in some predictable manner. Hence the theory upon 
which the rational equation is based can be tested by the agreements or 
disagreements among the constants as determined empirically in differ- 
ent experimental settings. These constants, furthermore, have reference 
to variables with additional (non-mathematical) meanings. 


Empirical Learning Curves 


Ebbinghaus’ lead was followed some twenty years later, in 1907 and 
1908, by attempts to state the learning curve in mathematical form, 
These early starts were followed from time to time by others as more 
men became interested in the problem.” 

As an example of the empirical approach, let us consider some sug- 
gestions first made by Culler in 1928, then elaborated later by Culler 
and Girden (1951). Arguing that the complete learning curve must be 
S-shaped, that is, a curve of increasing gains at first, later becoming one 
of decreasing gains as the upper limit is approached, Culler and Girden 
compare the fit of the normal ogive with that of the equation for mono- 
molecular autocatalytic functions. The normal ogive, is of course, the 
familiar integral of the normal probability curve. The mono-molecular 
autocatalytic function yields a very similar curve. When written ap- 
propriate to learning, its equation is: 


beaz 
c+ eär (2) 


where y = a measure of learning 
x = a measure of practice 
A = a constant for the learner and the task 
b = a limit of attainment 
c = a constant of integration 
e = the base for natural logarithms 


1Schiikarew (1907), Robertson (1908). See also Robertson (1923), 
2 For an able review to that date, see Gulliksen (1934). 
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This function,’ as in the case of other empirical curves, does provide 
some hints as to what the learning process may be like. The particular 
mathematical expression implies that the gain per trial is proportional to 
the product of the amount already learned and the amount remaining to 
be learned before the limit of learning is reached. By proper choice of 
constants, this will yield a curve of increasing gains at first, later be- 
coming a curve of decreasing gains. Population growth can be expressed 
by such a curve; * the conjecture is inviting that some self-limiting growth 
process may underlie learning. As matters stand, however, the curve is 
not selected because of any formal theory that learning is such a process, 


The curve is chosen merely because it fits. 
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FIGURE 52. The fit of an ogive to © Culler and Girden (1951), page 334. 


Culler from the learning of three-place symbols. 
Culler and Girden test both equations for a great many learning data, 
and find that both fit quite well. Some Hlustratiye curves are shown in 
Figure 52, These are fitted with an ogive; the authors conclude that 
empirical considerations alone do not permit a choice to be made be- 
tween the two curve forms that they tested. Culler and Girden are well 
aware that empirical curve-fitting is not enough, e a the me of 
earning function must ultimately be made upon i o k heory. 
They point out how futile it would be to obin serten tting curves 
through merely adding more parameters to an equation. l 
* The mono-molecular autocatalytic reaction ae proposed ein ce te for 


leami ‘ The stai 
( 19340 by Robertson (1908). Th 


* Pearl and Reed (1920). 38 
* Culler and Girden (1951), page 328. 
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When additional parameters are used, empirical results can, of 
course, be very closely approximated. The formula proposed for general 
use by Woodrow (1942), for example, contained five parametric con- 
stants: 

y =a + VP +k — (1 — f+) (8) 

His demonstration that he could fit it to a wide range of data did 
not make it survive as a general psychological law at all; it turned out to 
fit relationships of so many forms that it lost its theoretical interest. More 
than empirical curve-fitting is required if mathematics is to serve psycho- 
logical theory. 

While the deductive structure of Hulls theory is rational in form, 
the basic curve of learning that he used did not rest upon any of the 
fundamental postulates about the way in which learning takes place. 
Hence his curve belongs with the empirical curves. The form Hull used 
can be stated as follows: 


sHr =M(1 — e-**) (4) 
where sHpg = habit strength 
M = upper limit of habit strength 
t = trials 
k = a constant expressing learning rate 
e = the base of natural logarithms 
Such a curve (as in the case of the autocatalytic one) can be given 
a meaning that suggests some sort of theory. Hull’s formula implies, for 
example, that the gain per trial decreases with practice throughout learn- 
ing, and that the gain per trial is proportional to the amount remaining 
to be learned before the limit of learning is reached. But Hull did not 
state why this should be so. 


Rational Learning Curves 


An advance in the application of mathematical thinking to learning 
was made by Thurstone (1930a)(1930b) when he based his equations 
on a theory as to how learning takes place. The psychological assump- 
tions are not very complex, and he did not pay any attention to the mecha- 
nisms underlying learning. What happens, according to Thurstone, is that 
the animal (or other learner) performs a number of acts per unit of 
time. His attainment can be stated as the probability that one of these 
acts will be successful. Thurstone introduced law-of-effect assumptions 
(without calling them that), for he proposed that successful performances 
led to further successes with practice, and unsuccessful performances 
led to a dropping out of errors. 
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If the learner at any one time has available to him s successful acts 
and e erroneous ones, his probability of success is simply the proportion 
of successful acts: 


p= S 
s+e (5) 


Similarly, the probability of failure will be: 


a a r (6) 


If learning takes place with practice, we may write a differential 
equation to describe the increase in probability of success with time, in- 
troducing the constant k to describe the learning rate: 


ds k ks 


“a Pste (7) 


Error elimination can be expressed similarly, the negative sign in- 
dicating that errors are reduced with practice: 
de —ke 
= —kq = — 
dt s+e (8) 
Another learning constant is required, reflecting the difficulty or 


complexity of the task. ‘Thurstone shows by computations based on equa- 
tions (7) and (8) that the following equation must hold: 


m=sXe (9) 
That is, the product of errors and successes available at one time 
will be a constant (m). When the task is complex, this product will be 
large; when the task is simple, this product will be small. 
Without any further psychological assumptions, Thurstone is able 
to perform an integration yielding the following curve of learning: 


ee! ae kt 7 
vp- vm (10) 
eccess per unit of time 
tainment) 
learning ability 


task complexity 
hich probability is measured 


where p = probability of su 
(the measure of at 
k = constant reflecting 


m = constant reflecting 
the units over W. 
egration 


By strict mathematical transformations, Thurstone (1930b) was able 
to make some predictions from his equation, in addition to fitting ordinary 
learning curves, He predicted, for example, the relationship between 


t = time in 
z = constant of int 
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length of task and learning time under varied circumstances and found a 
satisfactory relationship between his prediction and existing data. 
Gulliksen (1934) generalized Thurstone’s theory another step and 
arrived at another rational equation that proved quite satisfactory in 
fitting learning data (Figure 53). The constants in his final equation in- 
cluded the initial strengths of correct and incorrect responses, a con- 
stant for the strength added by repeating and rewarding a correct 
response, and a constant for the strength subtracted by repeating and 
punishing an incorrect response. Later Gulliksen and Wolfle (1938) ex- 


Rat No.13 
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000 Experimental values 
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FIGURE 53. Success of Gulliksen’s equation in fitting data from individual 
rats trained in visual discrimination by Lashley’s jumping technique. Gulliksen 
(1934), page 420. 


tended the equation to cover discrimination learning and subjected it to 
experimental test. The papers of Thurstone, Gulliksen, and Gulliksen and 
Wolfle represent a kind of succession, in which each equation is some- 
what more general than the preceding one, and the earlier equations can 
be considered special cases of the later ones. 

This is about where the matter was left before World War II, 
There were other developments in the mathematics of learning, particu- 
larly by Rashevsky and his students and colleagues, but psychologists had 
not shown themselves ready to pay much heed to them.ë The painstaking 
volume by Hull and his collaborators entitled “Mathematico-deductive 
theory of rote learning” appeared in 1940, calling strongly to the attention 
of the psychological world the possibility of a forward step in rigorous 
theorizing, including not only derivations in ordinary mathematics but in 


® Rashevsky had dealt mathematically with learning problems in American psycho- 
logical journals as early as 1931, and his 1938 volume on Mathematical biophysics 
contained chapters on conditioned reflexes, the discrimination of relations, error 
elimination, rational learning and thinking 
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symbolic logic as well. The stage was well set for new mathematical ap- 
proaches after the disruptions of the war years, based, to be sure, in part 
upon developments that wartime demands brought about. 


SOME MODELS FROM OTHER FIELDS POTENTIALLY 
USEFUL FOR LEARNING THEORY 


In the last few years it has become customary to refer to some kinds 
of formal scientific theories as corresponding to one or another “model.” 
The notion of making a model to represent a theory is a very old one. 
Most of us have seen colored balls arranged on wires to represent the 
atoms in a molecule or crystal, though we know very well that the 
molecule under a microscope would not look like that. Yet the model 
does represent certain relationships. Or in describing the operation of 
the Organ of Corti and the basilar membrane we may have learned of 
a hydraulic model. Helmholtz’ resonance theory used a harp as a model. 
The model represents a series of relationships—mathematical, physical, 
conceptual—which appear to be appropriate for the understanding of 
some realm of data. There must be a kind of “fit” between the data and 
the model. The adequacy of the model can be judged by its success in 
, and in making verifiable predictions from the data. 

The more formally a model is worked out, the more likely it is to be 
a mathematical one. A mathematical model consists of a set of relation- 
ships and transformations that can be made consistently according to the 
definitions and rules of the particular kind of mathematics. Data of many 


kinds and from many fields of science may be treated according to the 
of mathematics. Thus the same rules of differentia- 


apply within calculus whether one is investigating 
the action of an electronic circuit, an airplane wing under stress, the 
transmission of a nervous impulse or the behavior of a crowd. 

While a mathematical model can therefore be used with a variety of 
ee arge F d to be developed in close conjunction with the solution 
of particular kinds of problems, and they show some of the history of 
their origins in the form they take. Whether or not these models will 

EA i ay depend upon some analogies 
prove useful within learning theory may n es dal d 
between the substantive problems in the field in which the model de- 
vel i field of learning. Le 
se er model from other fields are worth mentioning here: 
the Petter. eet the information-theory model, and the theory-of- 


games model. 


ordering data 


rules of any one kind 
tion and integration 
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The Feedback Model 


The science of cybernetics was christened by Wiener (1948), who 
encouraged the application of some principles coming from communi- 
cation engineering to physiological and social problems. The word 
“cybernetics” comes from a Greek word meaning steersman or governor, 
and cybernetics can be thought of as the study of control processes in 
machines, in organisms, in social groups. It is a mechanism of control 
that provides the model for these various activities. 

One of the central concepts of cybernetics, “feedback,” has gained 
wide currency in psychological discussion. Let us examine the analogy 
between adaptive behavior and a thermostatic control system. When the 
heat comes on and warms the room the thermostat records the rise in 
temperature. When the temperature reaches a critical point, the thermo- 
stat sends a “message” back to the furnace and turns off the heat. The 
furnace is thus controlled by a “feedback” from the consequences of the 
furnace’s operation. A man walking down a path has a “feedback” from 
his muscles, tendons, and joints, so that he makes corrective movements 
if the ground is uneven or if his foot strikes an obstacle, The awkward 
gait in tabes dorsalis arises because part of this feedback is missing, and 
the placing of the feet has to be controlled visually—thus using a less 
satisfactory feedback device. 

Is feedback a possible model for learning theory? Consider trial-and- 
error learning, especially in the form of approximation-and-correction, 
What the learner does on successive trials is regulated by the conse- 
quences of his activity. The old problem of the circularity of the law of 
effect can perhaps be resolved in terms of feedback, 


Another name for the feedback mechanism is servomechanism, and 
a large literature is now ava 


form.’ 

The feedback model as such has thus far been little used in le 
theory. Ellson (1949) 
tions of operational 


ilable, much of it expressed in mathematical 


arning 
showed how some of the input-output formula- 


analysis might prove useful in the psychology of 
learning, but he failed to combine the feedback concept with the input- 


output one. Leavitt and Mueller (1951) show how the old problem of 
knowledge of results can be treated in the language of “feedback.” The 
instructor tries to communicate to a student a geometrical form, using 
only words. Success is much greater if the student is free to ask questions 
and to receive answers in the form of “feedback.” This seems fairly ob- 


7 For example, MacColl (1945); Truxal (1955). 
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vious, yet there are some interesting possibilities in quantifying the 
amount of feedback and its consequences for learning. 


The Information-Theory Model 

Both the feedback model and the information-theory model have 
their origins in the study of communication. Yet information-theory has 
had a somewhat distinctive formulation, with a clear influence upon 
psychology.’ 

The communication model suggests that psychological events can 
be understood through analogy with the events that occur when a mes- 
sage is transmitted through an electronic transmission system (e.g, a 
long-distance teletype system). The communication paradigm developed 


by Shannon * is reproduced in Figure 54. 


Source of Communication sodar 
— channel of message 


FIGURE 54. Model for a communication system. After Shannon. Reproduced 


from Grant, 1954, page 63. 


: ; apa entes d p ‘ 
Information in the form of a message enters at the source and is 


converted to a code suitable for the given transmission system, A tele- 
type machine converts the typist's strokes into patterns of dots and dashes 
suitable for sending as electrical impulses. These go through the com- 
munication channel as electrical impulses, or perhaps as radio waves 
or light waves. They have to be decoded at the receiving end, and ap- 
pear as letters typed by the teletype receiving machine at the other end. 
Noise (static, etc.) may enter the message in the Process of transmis- 
Sion, and the message may be somewhat garbled berare itisityped mi iie 
destination. In military practice a special coding process may be intro- 
duced, so that the ordinary decoding done by the teletype machine will 
not suffice, and the original pea a be yielded only if the code is 
+ is brok a cryptoanalyst). 
iit- cr bs as oayehalogical events similar to the events in 


such ‘a communication system? One way of picturing them is shown in 
is Shannon and Weaver (1949). Useful 


tl 
vee BY Miller (1953) and Grant (1954), 


8 > r ation: 
standard work on information- 
he sta t 


introductions for psychologists are the pi 
© See Shannon and Weaver (1949). 
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Figure 55. The organism here corresponds to the communication chan- 
nel, transmitting information to the psychologist by way of its behavior. 
The observable responses make their appearance only as complex re- 
sultants of prior conditions as these are “encoded” (perceived? reacted 
to?) by the organism. The responses may be modified by non-essential 
circumstances (here “biosocial noise”), so that the observing psycholo- 
gist has a difficult task in interpreting (“decoding”) the subject’s be- 
havior. This is but one way of using the communication model to de- 
scribe psychological events, and other analogies are permissible. For 
example, the conversation between two people can be described accord- 


ing to the successive processes of encoding, transmitting, and decoding 
messages.’ 


Stimulus and . Observable Filtering Observing 
and decoding psychologist 


Bio-social 
noise 


FIGURE 55. Communication model in which the organism is conceived as 
emitting signals to be received or interpreted by the observing psychologist. From 
Grant, 1954, page 65. 


The service to psychology of the communication model has thus far 
come largely through some special considerations relating to information 
in its quantitative aspects. 

Information is, of course, a very familiar concept. We receive in- 
formation when we get our questions answered; we use information when 
we find our way about; the dictionary, the telephone directory, the en- 
cyclopedia, are full of information. But when we turn aside from the 
rich substance of information to consider only amount of information, 
and to find a unit by which to describe the quantity of information, we 
enter the province of information-theory. The basic notion is that in- 
formation is discriminative; it answers the question: “Which of a num- 
ber of alternatives is this?” The more alternatives there are, the more 
information is conveyed by the answer. If you give a person’s name you 
give more information about him than if you give merely his sex, because 
through his name you have identified him among many alternatives, while 
by giving his sex you have identified him among only two alternatives. 


10 For parallels between physical communication systems and various problems of 
human intercommunication, see Schramm (1954), pages 3-26. 
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The relation of information to number of alternatives is given more 
formal expression: the amount of information yielded by specifying one 
of a number of alternatives increases as the logarithm of the number of 
alternatives. 

Where does this logarithmic relationship come from? The simplest 
way to look at it is that we can communicate information in a formal 
sense by reducing alternatives one half at a time. Suppose you were 
trying to guess a number between 1 and 32 and can ask questions an- 
swered only by “yes” or “no.” The most information you could get from 
your first question by an orderly approach would be whether or not the 
number was 16 or less, or above 16, thus reducing your alternatives by 
half. By continuing this process, you would arrive at your number. How 


many questions would you have to ask? It is not hard to prepare a little 


table to see how this works: 


Range Within Which Number of Guesses 
Number Lies Required 


We may say that specifying a number from among a larger set of 

numbers gives more information than specifying a number from among 
> . 

a smaller set. The relationship between our two columns can be ex- 


pressed as follows: 

Number of guesses required = 
model used for information theory. The unit of informa- 
which is a clever adoption of a familiar way of 
nts of information, combined with a conden- 
“pit” of information is the unit of informa- 
tion gained whenever the number of alternatives is reduced by one-half. 
If a message reduces the number of alternatives k to some fraction k/x, 
then the amount of information in the message is log,x bits. 

Information-theory deals with such problems as the disturbance of 
a message as it goes through a tiansgnssion System. The system con- 
tains random interference (“noise”) which requires that the message have 
Some “redundancy” in order that it may be Pissarro at a end. The 
English language is said to be about 50 per cent redundant. That permits 
a message a be somewhat garbled as it comes from the telegraph office 
acd ie understood. Because of noise the amount of unavailable in- 
formation at the end of a transmission system is greater than at the be- 


log, number of alternatives 


This is the 
tion is called the “bit,” 
talking about small amou 
sation of “binary digit.” The 
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ginning, suggesting a parallel to the physical notion of entropy. In the 
transformation of energy, some is always dissipated as heat, and no 
machine is 100 per cent efficient. The second law of thermodynamics 
states this principle as an increase in entropy. So, too, in communication 
some available information tends to be lost and unavailable informa- 
tion increases. 

Because learning involves input and output, with processes of dis- 
crimination, variability, and interference between stimulus and re- 
sponse, the information-theory model might presumably find useful ap- 
plications. This has indeed proved to be the case. 

There is a familiar demonstration experiment in psychology in which 
we show how much easier it is to learn a list of related words than a list 
of unrelated ones, and how, in turn, the unrelated words are easier to 
learn than a list of nonsense syllables. While the results may be stated 
quantitatively (showing a statistically significant difference between 
each pair of lists) the differences between the lists, as materials to be 
memorized, remain qualitative (“related,” “unrelated,” “nonsense” ). 
Here information-theory comes to our help because it is possible to 
show on the basis of the statistical characteristics of the English lan- 
guage the amount of information that has to be acquired in order to 
learn each list. A familiar word conforms to English language practice 
in following one letter by another in non-random fashion. (An extreme 
illustration is always following q by u, but a t is also followed by an h 
beyond chance, a j seldom followed by a c, and so on.) Nonsense syl- 
lables are constructed out of more nearly random sequences of letters, so 
that more information has to be acquired in learning them than in learn- 
ing meaningful words. Word sequences also have their probabilities in 
English, so that less information has to be acquired to learn “related” 
words than to learn “unrelated” ones. This is particularly true if the re- 
lated words form a sentence, thus capitalizing on non-random features of 
familiar sentence construction in English. 

It would be difficult to take the typical demonstration experiment 
as ordinarily used and make the necessary computations to assign diffi- 
culty in terms of the amount of information that has to be acquired to 
learn each of the three kinds of lists, but, in principle, it could be done. 


11 Information measurement can be interpreted in terms of variance and covariance. 
Large variance corresponds to a large amount of unassimilated information (i.e. ig- 
norance). Covariance or correlation corresponds to the relationship between input 
and output, that is, to the amount of transmitted information. These points are made 
by G. A. Miller (1955). 
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Closely related experiments, however, have been designed and per- 
formed. 

Miller and Selfridge (1950), for example, constructed materials with 
different degrees of approximation to the statistical structure of English. 
A random selection of words from the Thorndike-Lorge list of the 30,000 
commonest words provided the “sentence” most unlike English. This they 
called the zero-order approximation to English. The next, or first-order 
approximation, was a random selection from the words produced by the 
subjects in the higher-order sentences about to be described. Then fol- 
lowed a series of approximations each constructed in similar fashion. The 
next approximation (second-order) was obtained by constructing a “sen- 
tence” one word at a time by giving the first word to one subject who 
responded with a single word, then giving his one-word response to an- 
other subject who added one more word, and so on until the “sentence” 
was constructed. This procedure goes beyond the purely random list in 
preserving some of the word-association sequences of English. The next 
order of approximation gave as the context for response the two preced- 
ing words in the sentence. The procedure was followed through 7-word 
sequences, Finally, the authors selected a passage from current fiction or 
biography as a meaningful English text to determine Bow close the 
higher orders of approximation were coming to meaningful material. 
Results for passages of different length constructed in this way are shown 
in Figure 56. It is quite clear that as the passages come nearer to the 
structure of English they are more readily recalled. To a rough approxi- 
mation, the amount of information socalled was constant, Hence diffi- 
culty was in reality not a function of meaningfulness per se, but of 
quantity of information that had to be acquired. 

Inspection of Figure 56 shows that a 50-word passage at the fifth 
order of approximation is learned as readily as a fully meaningful pas 
sage of text, Are we to assume then that the fifth order of approximation 
is really meaningful text? Here is an illustration of such a passage: 
rn our living by working towards a goal for his team 


e wasn’t it even pleasant to talk about and 
ld not tell me the reason why you are is 


__ House to ask for is to ea j 
in old New-York was a wonderful plac 
laugh hard when he tells lies he shou 
evident.12 

It appears that there are kinds of nonsense as easy to recall as pas- 
a 2 à one 

sages taken from a novel. The nonsense is easy if it preserves the learned 
Structure of the language even though it remains nonsense. The struc- 

anguage, 


1? Miller and Selfridge (1950), pages 182-183. 
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ture can be treated quantitatively according to information-theory. Thus 
information-theory provides a powerful new analytic tool. 

Hovland (1952b) has furnished another illustration of the usefulness 
of the information-theory approach. He reanalyzed the concept forma- 
tion studies by Smoke (1932) (1933). Smoke had found that in the ac- 
quiring of concepts through presenting various embodiments of the 
concept along with irrelevant cues, “positive instances” in which the es- 
sential characteristics of the concept were included facilitated learning 
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FIGURE 56. Amount retained after equivalent practice with materials of differ- 
ent degrees of approximation to English. Miller and Selfridge (1950), page 181. 


TEXT 


much more than “negative instances” in which some characteristics es- 
sential to the concept were lacking. Hovland questioned the generality 
of Smoke’s conclusion because of two possibilities: subjects might be 
reacting to differences in information in positive and negative instances, 
or they might for some reason find it more difficult to assimilate the in- 
formation given by negative instances. If the second possibility is the 
true one, Smoke’s conclusion would be acceptable as an important gen- 
eralization about concept formation. If, however, the first possibility is 
correct, and negative instances furnish less information than positive 
ones, stimulus presentation would have to be blamed for the finding, and 
the generalization about negative and positive instances would have 
little psychological meaning. 

It should be noted in passing that these questions can also be raised 
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about some cases testing the positive and negative aspects of the law 
of effect, in which verbal “punishment” was sometimes found less effec- 
tive than verbal “reward.” It will usually turn out that saying Wrong 
eliminates fewer possibilities than saying Right. In that event the gen- 
eralization about the asymmetry between reward and punishment takes 
on new meaning. 

Hoyland’s analysis of the amount of information conveyed by posi- 
tive and negative instances in the concept experiment shows that the 
amount varies greatly. In general, as the total number of possible re- 
sponses increases, the minimum number of negative instances needed 
to convey the concept increases, even when the necessary number of posi- 
tive instances remains constant. In the most extreme case, a minimum of 
two positive instances sufficed to convey the concept, while 625 negative 
instances were required! 

The experimental test conducted by Hovland and Weiss (1953) 
ts do not behave as a machine would which 


showed that human subjec 
aws of information-theory. Even when the 


was built according to the 1 ey 
amount of information yielded by positive and negative instances was 


equated, they learned better from positive instances than from negative 
ones. While thus showing the simple information-theory model not suf- 
ficient to account for concept learning, there was some gain in having 
equated the information for positive and negative instances. There was 
at least some learning from negative instances, a result Smoke had not 
found. 

It is evident that in the studies by Miller and Selfridge (1950) and 
of Hovland (1952b), information-theory has provided a useful tool. Many 
other studies are currently appearing. They will help to define the range 


of this usefulness." 


The Theory-of-Games Model 
n 1944 of von Neumann and Morgenstern’s Theory 


behavior marked the beginning of a very rapid 
mong economists, of a type of model of po- 
fields of social science, including psychology. 


Jeans heavily upon concepts originating in 
try, branches of mathematics until recently 
> 


The appearance i 
of games and economic 
development, especially a 
tential usefulness in other 
The mathematics employed 
set theory and linear geome 
little studied by social scientists. 

formation (“bits”) have been used almost 


18 Thus far quantitative aspects of in tial aspects of communica- 

: f can tion of the stimulus. The sequentia! asp unica 
ton, inching ti intreting problems of encoding and decoding, are Just beginning 
to be brought into the psychological laboratory. 
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' As its name implies, the theory attempts to define the optimum 
strategy in competitive games, such as simplified poker games or modi- 
fied chess games. A player makes his moves against uncertainty, either the 
uncertainties of chance events (such as the order of the cards in a 
well-shuffled deck), or the uncertainties of the moves of the opponent. 
The game provides a simplified model of economic behavior, in which 
the decisions of an entrepreneur are made according to his predictions 
of an uncertain market and the moves of his competitors. The original 
theory-of-games model assumes that the players are thoroughly rational, 
and that they always make the best decision that the available informa- 
tion allows. In the game of tick-tack-toe, for example, it is possible to 
master the rational strategy, so that the outcome of the game is predict- 
able for expert players. But we know that in many instances the rational 
course is not chosen. We shall meet an illustration on page 390, in which 
it will be pointed out that in guessing against a 75:25 odds, subjects com- 
monly guess in the 75:25 pattern, while theory-of-games would easily 
show that hits would be maximized by always guessing on the 75 side, 
that is, guessing in a 100:0 pattern. 

In addition to the estimation of probabilities of outcome, the in- 
dividual playing a game prefers one outcome to another. In the language 
of theory of games, one outcome has higher utility than another. In other 
words, decisions are based upon value systems as well as upon prediction 
systems.* Utility and value are essentially motivational terms, and hence 


relevant to psychology. Some empirical investigations have been made 
to translate assumed utility systems into measure 
through individual behavior.1* 


A game usually involves more than one player, so th 


of games tends to be applied to group decisions, althou 
individual choice is involved.* Some le 


s of utility as expressed 


at the theory 
gh, of course, 
arning theorists, however, have 
attempted to bring game-theory into relationship with individu 
ing. A relevant example is that of Flood (1954) w 
“On game-learning theory and some decision-making experiments,” 
Flood attempts to show how a player can learn to improve his 
strategy during a sequence of plays. He accepts basic assumptions from 
both the von Neumann-Morgenstern game theory and the Bush-Mosteller 
learning theory to be discussed later in this chapter, Among the assump- 


tions is one that learning can be described by the special type of 
14 A very helpful elementary introduction to these notions is contained in Bross 
(1953). m 
15 E.g., Mosteller and Nogee (1951). 
16 E.g., Arrow (1951b). 


al learn- 
ho entitles his paper 
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stochastic process known as a Markov process, in which only the imme- 
diately preceding event is used in making predictions about the one that 
follows it. The author admits that this assumption will probably not 
prove adequate.” 

There is likely to be a greater interplay between the theory-of-games 
model and psychology within the next few years, but it is already quite 
clear that the model will have to undergo some modifications to move 
from the assumption of rational behavior to more appropriate assump- 
tions based on the actual behavior of individuals. This need is widely 
recognized. For example, Estes in reporting individual behavior in the 
face of uncertainty concludes that “in a simple decision process the 
human subject tends to behave in accordance with the principles of as- 
sociative learning and not, in general, in the most rational manner as 
‘rational’ is conventionally defined.” +$ Similarly, Hoffman, Festinger, 
and Lawrence (1954), in a study of coalitions in a competitive situation, 
are able to specify motivational changes affecting the coalitions in a 
manner to deny the symmetrical utility function required by game 
theory. Davis has taken the position that their situation is gamelike, but 
probably better treated as general psychology than as game-theory."* 
Davis’ position is little likely to lead to a unified theory of social behavior. 

These brief introductions to feedback models, information-theory 
of games, suffice as a background for the model- 
contemporary learning theories. There is a family 
all these models. They all deal with sequential be- 
havior, they all deal with uncertainty and decision-making. While these 
three models were developed in contexts other than psychology, it is 
clear that they are gradually influencing contemporary psychological 


theorizin g. 


models, and theory 
building activity of 
resemblance among 


CONTEMPORARY 
MATICAL MODELS IN 
pa LEARNING THEORY 


We | Iready seen in a preliminary way, how mathematics has 

have alrea 4 ii ve-fitti 

been used in relation to leaming data. Empirical = gias Aans Ms 

loosely defined model: merely a family of l -p an 

Convenience and without theoretical justification. hen ~ a 

are bett ftt ] by a curve from a different family, the old model is 
etter ntted by 4 


« interpretati 
17 Markov processes were firs oned a me te iter m 
and Frick ( 1949 ), and they are assumed in mos! 
is Estes (1954a), page 136. 
° Davis (1954), page 15. 


ion of learning by Miller 


t menti athematical treatments. 
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abandoned in favor of a new one. Rational curve-fitting corresponds more 


closely to contemporary model-building, for the nature of the curve is 
fixed by the theory, and predictions follow. 


Models Based Largely upon Differential Calculus 


The success of calculus in handling so many problems of physics 
and engineering has made it the standard fare of most college mathe- 
matical sequences. Because calculus deals with rates of change, it 
to be an appropriate mathematics to try out in relation to pro 
learning. 

Models depending largely upon differential calculus have been 
extensively used by Rashevsky and his colleagues and students in studies 
over many years. While the models are usually referred to the nervous 
system, little of neuroanatomical detail is involved, so that the model is, 
in fact, more largely mathematical than anatomical or physiological. 
Despite the ingenuity displayed, and the mathematical sophistication, 
these models have had relatively little influence within psychology. 
Their neglect must be due only in part to the inability of psychologists 
to follow the mathematics, for other difficult mathematical models have 
been understood by at least a few psychologists. Some 


of the neglect may 
have been due to the lack of interest in neurophysiologi 


ical speculation by 
learning theorists. Possibly more important is the social fact that no one 
has attempted a consistent experimental program tied to the systematic 
developments. It is not enough that the mathematical Consequences satis- 
factorily fit old data. For psychologists to become interested, new and 
relevant data must be predicted. The failure of anyone to pick up the 
threads of the Rashevsky theories may be merely a kind of historical 
accident, for the possibilities appear to be there.?° 

We saw in Equations (7) and (8), page 373, how Th 
gan by setting up differential equations and then throu 
arrived at his learning-curve equation. Many models beg: 
To illustrate the naturalness of this approach to the co 
mathematical model, we may consider a very 
proposed by Simon (1954). He called it a “Berlitz model,” after the 
self-instruction method of learning a foreign 


language, He makes the 
following assumptions about a person Starting out to learn French matty 
method: 


appears 
blems of 


urstone be- 
gh integration 
in in this way. 
struction of a 
sketchy mode] recently 


*° For a review making the Rashevsky models somewhat more accessible, 
man (1954). These ae deal with social behavior, rather 


than with | Pie er 
earni 
the review reveals the type of logic employed. rning, but 
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The individual is free to practice as much or as little as he wishes. 

As he practices, the activity becomes easier for him. 

. The pleasantness of the activity increases as it becomes easier for him. 
The more pleasant the activity, the more he engages in it; the less pleas- 
ant the activity, the less he engages in it. 


popH 


It takes very few additional assumptions to express these verbal 
statements in a mathematical model. 

First, the rate according to which difficulty decreases as practice 
continues can be expressed by the following differential equation: 


dD 
a = —aDx (11) 


where D = level of difficulty at any one time 
x = rate at which practice takes place 
u = a constant based on the learner’s capacity to learn 
t = elapsed time 


This means, in words, that the rate of change in difficulty decreases 
as the product of the difficulty level at the time and the rate of practice 
at the time. With this equation as the starting point, we will come out 
with a logarithmic decrease of difficulty with practice when the differ- 
ential is integrated. Thus the statement that practice reduces difficulty 
has become more precise as we use à mathematical model instead of the 
verbal one; whether or not it is true remains a matter for empirical 
testing. 

A second differential equation expresses the motivational assumption 
that the task becomes more pleasant with practice, and that, as it becomes 
more pleasant, there will be a tendency to practice more: 


_& _ <i (¢ —x) (12) 


where x = rate at which practice takes place 
z — satiation level of practice for any level of difficulty 


Less practice is pleasant, more practice is unpleasant 
—b = a constant based on the learner’s motivation 


t = elapsed time 


Put into words, this says that the amount of practice will decrease 
proportionately to the amount that the rate of practice exceeds the 


satiation level. 


Even as simple a m eeu 
practice and difficulty, provided the original values of difficulty (Ds), 
), and the satiation level (x) are known. 


odel as this permits predictions of time paths of 


the initial practice rate (£o 
Here are examples of some common-sense predictions based on the 
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model. These predictions follow strictly, however, from the mathematical 
implications of the model: 

a. If the initial values of D, and x, are very high, it can be predicted 
that the decrease in x will be more rapid than the decrease in D, so that 
the student will give up his practice before learning French. 


ae 57. Some implications of Simon’s “Berlitz” model. Simon (1954), page 
b. With intermediate values of D, and Xo there will be temporary 
discouragement and slowing up of practice, because practice of difficult 
material is unpleasant, and unpleasantness slows practice. The mathe- 
matical model shows, however, that there is a critical point, and if difi- 
culty falls below this point before practice stops, practice will become 
pleasant and learning will proceed to mastery, 
c. If difficulty is low, and initial rate of practice 
according to this model, will proceed smoothly. 
These three conditions are illustrated in the cur 
Even the student entirely unfamiliar with c 
a certain naturalness in the attempt to fit this model to a learning situa- 
tion. The model is proposed by Simon for purposes of illustration, and he 


does not think of it as sufficiently developed to be used to desi gn em- 
pirical research. 


also low, learning, 


ves of Figure 57, 
aleulus will perceive 


Stochastic Models 


Psychologists have long been familiar with 
underlying psychological measurements. 
the possibility of extending probability c 


probability concepts as 
They have also recognized 
oncepts to sequential data of 
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the sort found in learning. Thus Thurstone (1930a), with a background 
in psychophysical measurement, found it appropriate to use probability 
concepts in developing his rational curve equation, although the mathe- 
matical procedures used were those of differential and integral calcu- 
lus. 

The kind of model based on probability mathematics rather than 
upon differential calculus ** has come to be called a stochastic model. 
A stochastic process is defined as a sequence of events together with the 
probabilities of these sequences. Thus for the older notion of a casual 
chain we now have the related concept of stochastic process—a chain 
with uncertainty at every link. These uncertain but possible events at 
each link are expressed according to the mathematics of probability. 
Perhaps the Tolman and Brunswik (1935) expression “causal texture” 
can be appropriately substituted for “causal chain” when we deal with 
stochastic processes. 

We shall turn now to two illustrations of stochastic models within 


learning theory. 


Estes’ Statistical-Association Model 


Estes (1950) presented a statistical model with psychological as- 
sumptions very similar to those of Guthrie. As general background theory 
he assumed that the stimulus complex consists of a “finite population of 
relatively independent environmental events.” Only a sample from this 
population is effective at any given time. Responses, too, fall into classes, 
and response instances occur and enter into experiments as the dependent 
variables. To develop a probability model, response probability has to be 
defined. Estes defined response probability as the average frequency of 
occurrence of members of a response class relative to the maximum 
er the conditions of the experiment. The basic 
that association occurs through contiguity. That 
is, if a stimulus element is present once along with a response occurrence, 
the association is formed at full strength through this single occurrence. 
Response probability for the class of responses being learned increases 
with practice because more stimulus elements become conditioned to it 
on succeeding trials, and more elements become alienated from their 
association with other response classes. This is, of course, Guthrie’s 
theory, expressed by Estes in mathematical form. These psychological 


possible frequency und 
learning assumption 1s 


is not used in the course of developing thes 

21 Thi t mean that calculus is no ed ping on 
gon cee not avoid calculus in probability theory any more than you avoid 
algebra i. icd: The main point is that calculus deals with regularities, while 


probability deals with measures that contain some degree of uncertainty. 
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assumptions suffice to permit equations to be written, predictions made, 
and agreements or disagreements with data determined. 

The theory first proposed by Estes (1950) has been refined mathe- 
matically and applied to new problems by Estes and Burke (1953) and 
Estes (1953) (1954a). 

Let us consider, first of all some data to which the developed model 
is appropriate. The experiment is one by Grant, Hake, and Hornseth 
(1951), extending a kind of experiment introduced by Humphreys 
(1939b). 

Two lights are mounted on a board before the subject. The subject 
is asked to guess, while the left light is on, whether or 
will flash on. The guess is recorded, and the subje 
not his guess was correct. The experiment was s 
different groups of subjects the percentages of s 
The percentages of trials on which the second |] 
0, 25, 50, 75, and 100 per cent, Results are plotte 
that after 60 trials subjects tended to b 
age at which the lights were flashing. 

Although these results appear plausible enou 
will show that they are not the results that 
decision-maker, as implied, say, 
which lights come on 75 per cen 
the time. Any gambler knows t 
will occur rather than another, 


not the second light 
ct notes whether or 
o arranged that for 
econd flashes varied, 
ight flashed on were 
d in Figure 58. Note 
e guessing at about the percent- 


gh, a little reflection 
would be given by a “rational” 
in game-theory. Consider the cases in 
t of the time and are off 25 per cent of 
hat if the odds are 3:1 that one event 
once he detects these odds he will maxi- 
mize his hits by always betting on the more frequent event. Yet the 
subjects somehow “preferred” to match the stimulus probabilities with 
their guesses rather than to maximize their hits, M 
ments have yielded similar results,22 


We need to examine a little more carefully the sense in which 


Estes’ theory is a contiguity theory similar to Guthrie’s. The events in 


this light-guessing experiment are conceived by Estes to paralle] classic. 
conditioning as follows: 


any similar experi- 


al 


1. Conditioned stimulus = signal light (left | 
2. (Conditioned response = saying “on” or ‘ 
going to happen on the right.) 


he eye following 50 
me asymptote for their 
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3. Unconditioned stimulus = right light “on” or “off.” 

4. Unconditioned response = saying “on” of “off” according to what hap- 

pens on the right. 

In a contiguity theory the conditioned response is unimportant ex- 
cept as an indicator of the state of the learned-response system, and as 
furnishing some of the components of the stimulus situation present 
when the unconditioned stimulus and response occur. The conditioned 
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F 3 58. Positive responses when a first light is followed by a second light 
IGURE ‘58. tapas of the trials. Note that the limit approached tends to corre- 

times that the light appears. The trials are numbered in 
ake, and Hornseth (1951). 


in specified percent: 
spond to the percentage of 
blocks of five. After Grant, H 


response as such is not important in determining the terminal events that 
are responsible for what is learned. The contiguous items in Estes 
system are the conditioned stimulus, the unconditioned stimulus, and the 
unconditioned response. The conditioned response (the “guess” as to 
whether or not the light is coming on, or which light in some cases) does 
not enter the equations as a modifier of the influence of the unconditioned 
stimulus, Thus there is no “confirmation” of a correct guess and no “cor- 
rection” of an erroneous one. All that really matters is the occurrence 
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or non-occurrence of a light and the verbal response to it in the presence 
of cues (the effective conditioned stimulus). Thus the interpretation is 
strictly a contiguous association one, without reinforcement or extinc- 
tion of a prior response, as in reinforcement-type theories, 

Now to return to Estes and Burke. We shall sce something of the 
construction of their theory in relation to the experiment previously 
described. In this we follow Estes (1953). 

Let E, be the case when the second light comes on and E, the case 
when the second light fails to come on. Let pi(n) be the probability 
that E, will be guessed on trial n, and p,(n+1) the probability that 
E, will be guessed on trial (n +1). The student unfamiliar with proba- 
bility notation will have to be on his guard not to interpret all paren- 
theses as signifying multiplication, as they commonly do in algebra. The 
parentheses after p,, here do not imply multiplication, Unfortunately, 
mathematicians expect you to know their conventions, which are no 
more consistent than the conventions of orthographers in other languages, 

The theory assumes that every time the right light comes on and is 
responded to by “on” (ie, whenever the unconditioned stimulus and 
unconditioned response occur) there will be an increase in the proba- 
bility of responding to the signal (the conditioned stimulus) by guessing 
“on” (the conditioned response). That is, if E, occurs on trial (n) the 


increase in probability of its being guessed on trial (n+1) can be 
stated according to the equation: 


pi(n + 1) — p, (n) = Ol — p,(n)] a3) 
where @ is a fraction between zero and one. 


ase in probability per trial 
is a constant fraction of the amount remaining to be learned, 1 — p(n 


perfect responding (at a probability of 1) 
and the present probability, P,(n). This will obviously yield the same 
form of learning curve as that adopted by Hull (see page 372). 


Now if E, (absence of light) occurs, the probability of Predicting 
E, diminishes because by association theory it is t 


that should be favored on the next trial. This diminution of prob 
is expressed by the following equation; 


Pi(n +1) —p,(n) = — 6p, (n) 

These two equations are known as operato 
They merely state in precise form the assump 
repeating one of a pair of alternative respons 
cases and decrease with negative cases. The c 


(14) 
r or difference equations. 
tion that Probabilities of 
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mined empirically, but once determined it should hold through many 
variations of the experiment. 

The fraction of trials that the light comes on (when E, occurs) can 
be expressed as m (Greek letter pi), and the trials on which it is omitted 
(when E, occurs) as (1 — ~). We know what happens when E, occurs 
from Equation (13) and what happens when E, occurs from Equation 
(14). Because we know the relative frequencies of E, and E,, we can 
combine Equations (13) and (14) directly: 


p(n +1) —p,(n) = x(6[1 — p,(n)]) + (1 — +) [—6p,(n)] (15) 

This equation looks difficult to the mathematical novice because of 
the parentheses and brackets, a subscript, some Greek letters, and un- 
certainty as to which parentheses are really subscripts and which mean 
multiplication, Anyone with high school algebra can carry out the 
multiplication and combine terms. When this is done, the following 
equation results: 

p(n +1) — p, (n) = blr — p,(n)] (16) 

Comparing Equation (16) with Equation (18), we see that m has 
replaced 1. In other words, the limit of learning has changed from 1 to z. 

Equation (16) gives us the expected change of p, from any one 
trial to the next. Now we skip some mathematical steps leading from 
Equation (16) to a derived learning-curve equation. It can be shown 
rigorously that beginning with trial 0 and going to trial n, we have as the 
probability of guessing E, on any trial n: 

p(n) =a — [r — p,(0)](1 — 6)" (17) 

This is a derived learning-curve equation, and in form it looks very 
much like many others in the literature, for example, that of Hull, 
Equation 4, page 372. But the constants have a special meaning in this 
case. Just as Hull’s curve approaches M as a limit, so this curve ap- 
proaches ~ as a limit. This follows because the second term in Equation 
(17) becomes negligible for very large values of n. But ~ is the relative 
frequency with which E, occurs. 

Now we come back to our experimental findings. The mathematical 
theory has predicted exactly the result found by Grant, Hake, and Horn- 
seth (1951), and other experimenters, for the asymptote of the guesses 
in these experiments tends to be ~, the relative frequency with which 


the second light comes on. l 
While this is a most incomplete account of Estes’ theory, it serves at 


least as an introduction to it. 
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Bush and Mosteller’s Linear-Operator Model 


Bush and Mosteller (195la) were the first to propose a linear- 
operator model for learning theory, although such operators were later 
“incorporated into the Estes and Burke model just described. We shall 
see presently how mathematically similar the two models are, but first 
we may examine the similarities and differences in psychological as- 
sumptions. 

Bush and Mosteller make essentially the same assumptions as Estes 
with respect to stimulus and response, Their assumptions are stated 
with particular clarity in their second paper. In this paper they use- 
the notation of “set theory” in mathematics, For both Estes and Bush 
and Mosteller the organism perceives (or reacts to) some fraction of 
the total stimuli available. Whereas Estes describes this fraction in 
terms of the proportion of elements in a totality, Bush and Mosteller 


speak of the “measure” of the elements. The “measure” is a kind of sum 
of weighted elements, tes’ number of 


used to derive 
ychological as- 


and serves the same purpose as Es 
elements. The set-theoretic model in this second paper is 


the linear operators of the earlier paper. Because the ps 
sumptions are unchanged in this more elegant mathematical treatment, 
we return now to the original psychological assumptions, 

The basic difference between the psychological assumptions of 
Bush and Mosteller and of Estes rests on interpretation of reinforcement, 
Estes follows a contiguous association theory, not requiring the opera- 
tion of reinforcement or of extinction as fundamental principles, Bush 
and Mosteller adopt principles of learning more like those of Hull and 
Skinner, accepting one parameter (a) for the positive influence of rein- 
forcement, and another parameter (b) for the negative influence of 


punishment or of work required in responding, that is, a work inter- 
pretation of extinction. 


Their basic operator, in their notation, is; 


Qo =p +a(1—p) — bp (18) 
where Q, = the operator Q Operating on p, the probabili that 
i will occur during a specified fis p y hatia response 
a = parameter associated with reward 


b = parameter associated with punishment or work required 
p = probability of response 


This is the linear operator that gives the model its name.*4 


23 Bush and Mosteller cee n 

*4 We may rewrite Equation (18) in a form to show more clear} iti P 
function: Q, = (1 — a — b)p +a. Because (ise baud (a) are eat i linear 
equation is in typical linear form, eg., y =ax +b, mts, the 
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We may define the isolated effect of reinforcement by setting b=0 
in Equation (18): 
l Q= p+ all —p) (19) 
Similarly, the isolated effect of extinction can be studied by setting 
a=0 in Equation (18): 
Q p=p-bp (20) 
Now we have two equations from Estes, Equations (13) and (14) 
and two from Bush and Mosteller, Equations (19) and (20). How are 
these related? In the case of flashing lights we may assume that the 
effect of saying “on” to a light that has appeared will affect the next 
performance in an amount equal to the effect of saying “off” to a light 
that has not come on (although, of course, the effects are in opposite 
directions). That is, we may set the parameters all equal, a= b = 6. 
Then we can show that the Estes and the Bush and Mosteller equations 
are equivalent. They may be written in parallel as follows: 


p(n +1) =p,(n) + oll — p,(n)] From Equation (13) 

Q, =p + oll — pl From Equation (19) 
1 

n + 1) =p,(n) — 6p, (n) From Equation (14) 

a ) d i = ap From Equation (20) 


The only remaining differences are in notation. 

This coincidence of two models based on quite different psycho- 
logical assumptions is significant, but the case illustrated here is a 
special one, without risk, with the assumption of equal consequences 
of alternative occurrences, and without any assumption of work involved 
in responding. When these several assumptions are not met, the two 
models may yield different predictions. 

Let us now look at a single illustration of the Bush and Mosteller 
model in use. Consider the course of learning in a Skinner box. One 
measure of learning is the total number of responses to extinction follow- 
ing various numbers of reinforcements. Data are available from the 
work of Williams (1938) for responses after 5, 10, 30, and 90 reinforce- 
ments. These are plotted in Figure 59. The smooth curve is the one pre- 
dicted from the Bush and Mosteller model, but we need to retrace the 
steps to see how it was arrived at. i i 

Beginning with the operator in Equation (18), the change of rate 
of responding with time can be computed on the assumption that, start- 
ing with any probability p, the rate will increase according to this proba- 
bility changed by the operator. According to the mathematics of differ- 


ential and integral calculus, appropriate to changes of rate in a free- 
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response situation of this kind, an expression can be derived for the 
maximum rate of responding.” This turns out to be: 


w 
Vmax = 14 p/a (21) 


where Vmax = rate of responding with infinite time 
w = maximum possible rate of responding when p = 1; “activity 
level” 
b/a = ratio of the constants from the operator 


What does this maximum mean? The final rate of responding in- 
creases with a, hence with the amount of reinforcement, and decreases 
with b, the amount of work required in responding (or other negative 
influences). Under most circumstances the maximum possible rate (w) 
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FIGURE 59. Total extinction responses as a function of number of reinforce 
Data from Williams (1938). Smooth curve as derived from Bush and Mostel 
learning model. After Bush and Mosteller (1951 


a), page 320. 
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is missing from the Estes-Burke model, which does not 
the work required in responding. 
By similar mathematical 
tion for the total number of 
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xtinction, 
25 Details of the derivations here summarized very sketchil i 
Mosteller (1951a), pages 817-319, bg chily are found in Bush and 
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provided only the rate of responding at the beginning of extinction (Ve) 
is known. This turns out: 
J 
mr = — log Ve 


bu’ V; (22) 


where my = total responses to a criterion of extinction 
b = constant from the operator 
V, = rate of response at beginning of extinction 
V; = final rate at “end” of extinction (i.e., criterion for empirically 
“completed” extinction ) 


Equation (22) fits the data of Skinner-type extinction curves. But 
we wish to go one step further in fitting the Williams data of Figure 59. 
We must express the total number of extinction responses as a function 
of n, the number of reinforcements. 

The original operator, if repeatedly applied, produces a curve of 
acquisition very similar to that of Estes and Burke in Equation (16). 
In the notation of Bush and Mosteller we have: 


Q'p = BN pe a —p |(l1—a-— b)” (28) 
} a+b a+b 2 
where Q"p = probability of response after n reinforcements. 
a, b = the constants from the operator 
n = number of reinforcements 
p= probability of response at the beginning of reinforcement 
A transformation involving some additional steps that need not con- 

cern us rewrites Equation (23) in terms of rates of response at the be- 
ginning (V,), when extinction is undertaken (V,), and at the maximum 


(Vinax) 


Ve = Vmax — (Vmax — Vo) (1 — a — b)” (24) 


Note that to produce identity between the Estes Equation (17) and 
the Bush and Mosteller Equation (24) we need only make the follow- 


ing assumptions: 


V maxs = 7 
a=0 . 
b = 0 (for Estes does not consider the work factor) 

We return now to the problem of fitting the Williams data. The 
main relationships are contained in Equations (22) and (24). The 
second of these relates the rate of responding at the time extinction is 
begun to the number of prior reinforcements, while the first predicts how 
many extinction responses will follow this level of responding, All we 
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need to do is substitute Equation (24) into Equation (22) and we 
have our required equation. We have: 


7 
m= poef g [i] es 
Inspection of the Williams data led Bush and Mosteller to set 
Vmas/Vs = 5, and V, = V;. After this gross empirical step, the constants 
a and b were determined by curve-fitting, yielding a = .014 and b = .026, 
Substituting these values, Equation (25) reduces to the following: 


m= 18 | 5- tase)» | (26) 


It is now clear that we have an equation in which mmr, the total 
responses to extinction, can be computed by substituting a value for n, 
the number of reinforcements. This is the computation that produced 
the points from which the smooth curve of Figure 59 was constructed. 
The fit, as we see by inspection, is satisfactory. 

This one specimen can serve as the barest introduction to the Bush 


and Mosteller models, which have recently been published in book 
form.?° 


Relative Success of Stochastic Models 


The models of Estes, Estes and Burke, and of Bush and Mosteller have 
been quite successful in predicting the course of events within lever- 
pressing problems, simple runway problems, discrimination 
to uncertainty. Data have been derived from both anim 
experimentation. 
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26 Bush and Mosteller (1955). The book had not yet a : P 
was written. yet appeared when this chapter 
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equations. But the speed of light can be measured, and thus can give 
independent justification for identifying the constant with the speed of 
light. If Estes’ constant @ represents some fraction of the total stimulus 
elements effective at once, there ought to be some way of demonstrating 
this directly. Similarly, Bush and Mosteller’s constants a and b ought 
to have independent justification through quantitative studies of motiva- 
tion and work, independent of learning. But this more searching analysis 
of the theories can wait until the success with interchangeable constants 
is demonstrated. 

Mosteller (1953) has presented a table showing the value of Estes 
and Burke’s @ as derived from a number of symmetrical two-choice 
situations, resembling the experiment of Grant, Hake, and Hornseth 
previously described, but also including some experiments arranged as 
a gambling set-up, in which the subject had to pay to play, and thus had 
additional motivation for desiring to win. Mosteller’s table is reproduced 
as Table 6. 

Two things need to be said about this table: 

1. Of 22 determinations with rats, high school students, and college 
students, under a variety of conditions including many pay-off fre- 
quencies, the value of 8 is either .08 or .04 in 14 cases, and only four cases 
lie outside the range of .02 to .05. This is rather interesting agreement, 


considering that all the data are fitted in accordance with a single param- 


eter. 
A word of caution is in order. At the asymptote m (or Vmax) the 


term that includes @ (or a and b in the other system) approaches zero, 
so that the prediction of the value of the asymptote is independent of 6. 
The value of @ specifies the rate at which the asymptote is approached, 
ange in @ may make a great change in the number of 
o approach the asymptote. For example, let us set the 
quantity (1 — p = Ql. Then by substituting values of @ and solving 
for n we can see what difference 0 makes in the number of trials required 
to approach to within a standard distance of the asymptote. By this cal- 
culation the range of trials required to approach the asymptote with a 8 
of .05 is 90 trials, whereas with a @ of .02 the corresponding value is 
228 trials, Hence very small changes in 0 may produce significant differ- 
ences in the number of trials required to reach some criterion of 


mastery.” 


and a slight ch 
trials required t 


cs as H. Lawrence, who called my attention to the importance of small 
ae ee points out that 8 varies more widely if the vsa method of least 
squares TENE in favor of the method of maximum likelihood in fitting curves 
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$ ic Two-Choice 
6. Value of Estes’ @ Computed from a Number of Symmetric 
— Situations. After Mosteller, 1953, page 42. 


Pay-off Frequencies Experimenter, Subjects, Conditions Value of 8 


100:0 Stanley (rats in T-maze ) 04 
Jarrett (Harvard students ) 
—-pay to play -ll 
—free play 05 
50:0 Stanley (rats in T-maze) 04 
Jarrett (Harvard students ) 
—pay to play 04 
—tree play 02 
Robillard (students) 
—rewards of 0-cents 04 
—rewards of l-cent 03 
—rewards of 5-cents 03 
Cards (high school students) 04 
(Harvard students) .07 
75:25 Stanley (rats in T-maze) 04 
Jarrett (Harvard students ) 
—pay to play 03 
Estes (students ) 04 
Jarvik (students) 05 
100:50 Jarrett (Harvard students) 
—pay to play .07 
80:0 Robillard (students) 
—no rewards .06 
80:40 Robillard (students) 
—no rewards 03 
67:33 Jarvik (students) .03 
60:40 Jarvik (students) .02 
60:30 Robillard (students) 
—no rewards 03 
30:0 Robillard (students) 
—no rewards 03 


2. The four cases in w 
the rest are psychological 
for theory. 

For example, in the 100:0 problem it m 
or not the Harvard students had to pay one 
in order to win one cent. Also the fact t 


hich the parameter disagrees strikingly with 
y interesting, and pose important problems 


akes a real difference whether 
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hat 100:50 has consequences 


imum likelihood is mor 
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ts the constancy of 9, 


e appropriate than the 
roach to the asymptote. 
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different from 50:0 is psychologically interesting. From a probability 
point of view these are alike (provided we weight positive and negative 
cases equally): certainty on one side and a 50:50 chance on the other. 
Yet they are unlike in the pattern of responses they call out from the 
subject. These discordant results show that some supplementary as- 
sumptions are needed before the theories can be considered completely 
general. 

The models described are, of course, unfinished ones. The original 
model-builders are carrying on with them, and others have come along 
with new suggestions. For example, Restle (1955) has presented a 
variation of the model in which @ is interpreted as the proportion of 
relevant cues.** This turns out to be a simplifying assumption, so far 
as discrimination is concerned, and leads to some satisfactory predic- 
tions. Restle’s work comes nearer to providing an independent measure 
of what is essentially Estes’ 8, and shows circumstances under which 6 


values change. 


THE PLACE OF MATHEMATICS IN LEARNING 


With the advance of psychology as a science there will inevitably 
be an increased use of mathematics. Mathematics is the language of 
measurement, of precision, of prediction. Yet there is no royal road to 
scientific achievement and mathematics must remain subservient to 
observation and reflection. Instruments of precision in the laboratory are 
also the means of scientific progress, but we all know that in the wrong 
hands they may lead to narrowness and triviality instead of to advance. 
Therefore we need to examine mathematics critically as a powerful tool, 


but not a magical one. 


The Improvement of Data 

The relationship between theory and data is circular: without 
theories we do not know what data to look for, and without good data 
we cannot test our theories. 

We have been greatly helped in the improvement of our data through 
that branch of applied mathematics known as statistics. We have learned 
how to test the significance of differences, how to express reliability 
and validity of scores, how to use correlation as a measure of relation- 
ship, and factor analysis to comprehend a table of intercorrelations. We 


28 The concept of relevant and irrelevant cues is discussed by Lawrence (1950), with 


experimental justification for the distinction. 
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have gone ahead to learn more about the design of experiments so that 
results can be secured both economically and in a form suitable for 
statistical treatment. Statisticians have helped us also in the construction 
of scales by which data can be ordered, and we have learned to dis- 
tinguish between interval scales and ratio scales. 

These services of statistics are reflected throughout the experimental 
study of learning. Because through the better application of st 
methods the yield of data is better, the learnin 
needs satisfactory data if he is to test with an 
from his theories. 


atistical 
g theorist is helped. He 
y precision the predictions 
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name, being associated with 
in contact with data, is es- 
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so the contemporary mathe- 
s to bring order out of exist- 
uggest new experiments that 


een summarized by Estes (1953) 


1. Information incor 
diction. 

2. The results of different experiments often become 
comparable when referred to the same formal model. 

8. The quantitative analysis of data may be facilitated by the model, 
Hidden or confounded variables can often be unraveled if the mathe- 
matical model is a good one. . 


porated in a model is in a form suitable for pre- 
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4. What is the probability that a man who has failed on trials 7, 8, 
and 10 will succeed on trial 14? 

He points out that routine curve-fitting would require new calcula- 
tions direct from data in order to answer each of these questions. He 
concludes: “Thus, in a reasonable sense the few constants plus the model 
can contain nearly all the useful information in the original data.” 

The purpose of the model is clear enough. It provides a formal 
theoretical structure into which data can be fitted, and it permits the 
statement of testable scientific propositions about the relationships 


among the data. 


Some Cautions 

The fascination and promise of mathematical theories are so great 
that a word or two of caution may be appropriate. That is, we need to 
keep all developments in proper perspective. It is not a question of being 
for or against the development of mathematical models. They should 
most certainly be developed, but in their appropriate relationship to 
other aspects of science. The following statements call attention to some 
of the considerations that ought to be kept in mind as mathematical 
theories are evaluated. 

1. The history of science tells us that some of the most important 
developments in the relation of mathematics to science came about with 
very simple mathematics. Thus chemistry and atomic theory generally 
were enormously advanced by the law of combining proportions (Gay- 
Lussac’s law), which stated simply that the combining volumes of gases 
entering into compounds could be stated as simple whole numbers. 
Thus the ratio of hydrogen and oxygen combining to form water is 2:1. 
What mathematics could be simpler? Another advance in atomic theory 
was Mendeleev’s periodic arrangement of the elements. He noted a 
kind of periodicity in which, with elements arranged in order of in- 
creasing atomic weights, certain elements resembled earlier ones in 
valence and other properties. Again, the simplest of mathematics, with 
the most profound results. Or, to choose another field, consider Mendel’s 
famous ratio of 3:1 for the types showing up after a given kind of mat- 
ing. Here is the background for the modern science of genetics. What 
these examples suggest is that scientific advance comes when the right 
concepts are discovered. The right concepts may prove to be beauti- 
fully simple and capable of very simple mathematical expression. The 
complicated mathematics becomes more useful later, after work goes 


on within the right concept system. 
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This is not, of course, a complete history of science, and sometimes it 
has required real mathematical genius to bring scientific advance. Had 
Kepler not been a skilled mathematician he could not have brought order 
out of the very complex mass of data left by Tycho Brahé. When by a 
series of curve-fitting attempts he found that he could fit the path of 
Mars with an elipse, his “model” created a kind of order until the more 
satisfactory “model” of Newton succeeded his. The fact remains, how- 
ever, that the search for the most appropriate concepts by which to 
harmonize scientific data often may not require complex mathematics. 

2. The verification of a scientific theory is a slow process, and a 
few successes do not suffice to assure the enduring value of the theory. 
As we have seen, theories starting out with different assumptions may ac- 
count for some of the same data. As long as competing hypotheses are in 
the running, further tests are needed to permit choice among them. A 
really poor model gets rejected early, but a plausible model may survive 
for some time before it is discarded in favor of a better one, 

3. A model is appropriate to a limited range of phenomena at most. 
This fact is ignored in two ways. The first way in which the limitations 
are ignored is by the model-builder who is tempted to generalize the 
applicability of his model beyond the data to which it is appropriate. 
This is all right, provided he is as cautious in his extrapolations as he 
is in his model-building, but the temptation is great to argue by analogy 
without testing the analogy as carefully as it needs to be tested. The 
second way in which these limitations are ignored is by the critic who 
brings in as a refutation of the model some data that the model does 
not propose to handle,*? 

4. Models starting with unlike assumptions may be complemen- 
tary rather than competitive. A model such as that of Pitts (1943), which 
follows the Rashevsky pattern in making assumptions about the nervous 
system as the basis for a model of conditioning and learning, may comple- 
ment a model such as that of Bush and Mosteller that is concerned with 
events going on at the level of learning data. In other words, it ma 
turn out that more than one model is “true,” in the sense that it fairly 
represents reality when that reality is abstract y 


ed in a certain way. 
5. Mathematical models are not foolpro 


of, even when the mathe- 
matics used is elegant and accurate. Mathematical argum 


f Sane ents have to be 
scrutinized for fallacies just as verbal arguments do, It is usually easier 
to detect the error in mathematics than in verbal stat 


, : ement, but not 
always. The mathematical system is self 


-contained, but its appropriate- 
29 This point has been made by Spence (1952b). 
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ness for a given set of data always involves assumptions, some of which 
may be far from obvious. For example, more than one process may be 
symbolized by a common parameter. We have seen how the parameter b 
in the Bush and Mosteller model stands for inhibitory processes such as 
punishment and work. But this is a large order! Whenever b appears in 
an equation it is appropriately multiplied or divided or squared, but if 
punishment and work are not processes that act alike, this parameter 
does not fully represent them. Estes’ model is supposed to get along 
without assumptions about reinforcement or extinction, yet the operators 
distinguish between success and failure in a manner that looks suspi- 
ciously as though some kind of “feedback” is implied. The mathematical 
model is not going to save us from theoretical arguments, although it 
may perhaps save us from foolish ones. 

6. Because mathematical models are useful ones, it does not follow 
that they will pre-empt the field, and that no other kind of theory will 
be worth listening to. Guthrie’s theory, which provides the background of 
assumptions for Estes, was derived without benefit of mathematics. 
Freud’s various theories, possibly the most influential of all in the psy- 
chology of the twentieth century, were developed by observations un- 
aided with either instruments or mathematics. While mathematical 
models deserve prestige, they need not lead to the neglect of other kinds 


of theorizing. 


SUPPLEMENTARY READINGS 


BOOKS 
Relevant mathematical models f' 
Arrow, K. J. (1951) Social choice and individual values. 


Bross "7, (1953) Design for decision. 
Se oe E (1951) Methods in operations research. 


RASHEVSK . (1938 Mathematical biophysics. l 
Bereikt Ne D A W. (1949) The mathematical theory of communi- 


rom non-psychological fields 


deinen T G (1955) Automatic feedback system synthesis. 


VON NEUMANN, J and MORGENSTERN, O. (1947) Theory of games and economic 
ANN, Jo 


behavior. y 
Wainer, N. (1948) Cybernetics. 
in psychology and learning theory 
55) Stochastic models for learning. 
H. D. (1945) Mathematical biophysics of 


Mathematical models 


Busu, R. R., and MOSTELLER, F. (19 
HousenoLpER, A. S., and LANDAHL, 
the central nervous system. 


Hutt, C. L., and others (1940) Mathematico-deductive theory of rote learning. 


406 THEORIES OF LEARNING 


5 i inking in the social sciences. 
P. F., ed. (1954) Mathematical thinking int jal 7 
PN e H., and Davıs, R. L., eds. (1954) Decision processes. 


SHORTER INTRODUCTIONS 


i i i i In D. Lerner 
K. J. (1951) Mathematical models in the social sciences. 
a tp Lasswell, eds. The policy sciences. 129-154. 
BERTALANFFY, L. von (1951) Theoretical models in biology and psychology. 
x lity, 20, 24-38. ; 
eee Berk H., and Trai, R. M. (1954) Some views on mathe- 
matical models and measurement theory. Psychol. Rev., 61, 132-144. 
Extson, D. G. (1949) Application of operational analysis to human motor be- 
havior. Psychol. Rev., 56, 9-17. ; ; 
Grant, D. A. (1954) The discrimination of sequences in stimulus events and 
the transmission of information. Amer. Psychologist, 9, 62-68. 
Mer, G. A. (1953) What is information measurement? Amer. Psychologist, 
Mowner, O. H. (1954) Ego psychology, cybernetics, and learning theory. In 
Kentucky Symposium. Learning theory, personality theory, and clinical re- 
search, 81-90. 


Spence, K. W. (1952) Mathematical formulations of learning phenomena. 
Psychol. Rev., 59, 152-160. 


REPRESENTATIVE PAPERS ON MATHEMATICAL MODELS FOR LEARNING TESTED 

AGAINST EMPIRICAL FINDINGS 

Bus, R. R., MosteLLER, F., and Tuompson, G. L. (1954) A formal structure 
for multiple-choice situations. In R. M. Thrall, C. H. Coombs, and R. L. 
Davis, eds. Decision processes, 99-126. 

Estes, W. K. (1954) Individual behavior in uncertain situations: 
tion in terms of statistical association theory. In R. M. Thrall, C. H. Coombs, 
and R. L. Davis, eds. Decision processes, 127-137, 

Fioop, M. M. (1954) On game-learning theory and some decision-making ex- 
periments. In R. M. Thrall, C. H. Coombs, and R. L. Davis, eds. Decision 
processes, 139-158. 

Frick, F. C., and Miter, G. A. (1951) A statistical description of operant con- 
ditioning. Amer. J. Psychol., 64, 20-36. 
Granam, C. H., and Gacné, R, M. (1940) The acquisition, extinction, and spon- 
aneo recovery of a conditioned operant response. J. exp. Psychol., 26 
251-281. i 


GuLLKseN, H., and Wotrte, D. L. (1938) A theory of learning a dt 
Psychometrika, 3, 127-149; 225-261. ja S A renate 
Hovrann, C. I., and Weiss, W. (1953) Transmission of information concernin 
concepts through positive and negative instances. J. exp. Psychol., 45 
175-182. uaa 
Mutter, G. A., and McG, W. J. (1952) A statistical descri ti f i 
learning. Psychometrika, 17, 369-396. one of verbal 
Prrrs, W. (1943) A general theory of learning and conditioning. Psych i 
er ger g &: Psychometrika, 
Restxe, F. (1955) A theory of discrimination learning. Psychol, Rev., 62, 11-19, 
Spence, K. W. (1936) The nature of discrimination learning in animals, Ps, chol 
Rev., 43, 427-449. aa 
THURSTONE, L. L. (1933) The error function in maze learning, J, 
ant 8- J. gen. Psychol., 


an interpreta- 


Chapter 12 


CURRENT DEVELOPMENTS: 
I. REINFORCEMENT AND DRIVE 


N an active field of research, with continuous interaction between ex- 
perimentation and theorizing, we do not expect the younger genera- 
tion to remain loyal to the past. In other words, we cannot expect the 
students of Thorndike to remain simply Thorndikian, those of Guthrie 
simply Guthrian, or those of Tolman merely Tolmanian. Yet we do find 
some of the preoccupations of the newer generation reflecting a back- 
ground of training under one or another of the earlier systematists. 
Contemporary developments can best be understood if they are reflected 
against a thorough understanding of the earlier systems. One role of 
history is so to familiarize us with persistent and recurrent issues that we 
gain perspective on the conflicts and quarrels of the present. Recent his- 
tory is no less important for this purpose than more remote history. Here 
lies the justification for the treatment of the separate systems in the earlier 
chapters of this book. In the process of historical development some of 
the earlier cleavages will become blurred as newer controversies take the 
center of the stage, and eventually there may be entirely new alignments. 
It is instructive to review a listing of learning theories made in 1926 

by Perrin and Klein.’ They listed (1) pleasure pain theory, (2) modified 
pleasure-pain (Thorndike), (3) confirmation-inhibition (Hobhouse), (4) 
congruity theory (Holmes), (5) drive or motor-set theory ( Woodworth, 
Tolman), (6) completeness of response (Peterson), (7) frequency-recency 
(Watson), (8) intensity (Carr ) and (9) conditioned-reflex theory 
(Smith and Guthrie). Many of the issues dealt with in these theories of 
1926 are still current, and many of the same names continue to appear in 
theoretical discussion today. But the lines of demarcation by which the 
theories could be grouped into major families had not yet became clear. 
Gestalt learning theory had not yet become influential in America, Tol- 


1 Perrin and Klein (1926), pages ae 
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man had not yet sharpened his views, and the positions of Hull and 
Skinner were still unborn. It may very well happen that in a few years 
from now the chapter divisions of this book will be as inappropriate to 
newer issues as the divisions of 1926 are today. The recent distinction be- 
tween the two families of theories, stimulus-response theories and cog- 
nitive theories, is already in the process of breaking down, partly be- 
cause of increased interest all around in problems of perception, partly 
because of the emergence of information-theory as a device for objecti- 
fying some aspects of cognitive processes, partly because of the awk- 
ward number of qualifications in respect to present theories, necessitated 
by the unceasing impact of new experimental data. 

Because of the rapidity of developments, a chapter of this kind can- 
not serve either as a thorough or an up-to-date review of the contem- 
porary literature.? All it can do is to call attention to some lines of 
development that appear promising or important, citing a few theoretical 
and experimental articles as indicative of the trends. 


THE INTERPRETATION OF REINFORCEMENT 


Within stimulus-response theories a basic iss 
pretation of reinforcement, dividin 
into contiguity-theorists and reinf 
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2 This purpose is served better by the reviews a earing in the Ps; i i 
and the Annual Review of Psychology. BP g © Psychological Bulletin 
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tiguous conditioning by arguing that some reinforcement (primary or 
secondary) is always present when learning occurs, even under the 
experimental arrangements of classical conditioning. Pavlov’s dog, after 
all, was hungry and ate after salivating. A third position, advanced by 
others (e.g., Skinner, Schlosberg, Mowrer) is that the two kinds of con- 
ditioning may require two kinds of theory, one typified by learning under 
contiguity alone, the other by learning under reinforcement. These 
several possibilities all lie within stimulus-response theory. 

Not all reinforcement-systematists have a “theory” of reinforce- 
ment. That is, one can accept the fact that certain kinds of events are 
reinforcing because they lead to the appearance of conditioned responses 
or to increase of habit strength, and let it go at that.t A learning theorist 
such as Skinner or Spence who is not interested in the physiological 
substratum of learning does not deny that learning goes on in the body. 
He may simply turn the problems of the physiology of learning over to 
someone else. So, too, the theorist interested in interrelationships among 
learning data may not concern himself about the intimate relationship 
between reinforcement and physiological needs. The commonest inter- 
pretation of reinforcement, however, can be traced back to Sherrington’ 
preparatory and consummatory response. It now goes by the name of the 
drive-reduction theory. We have, of course, met this before, particularly 
in Hull’s theory. The basic sequence is that deprivation produces a state of 
need in the organism. This need finds representation psychologically asa 
tension state known as drive, a state that energizes into action and produces 
stimuli associated with it. Thus the food-deprived animal shows the 
restless activity characteristic of the hunger drive, and the stimuli associ- 
ated with the drive lead to food-seeking. This restless state of tension 
is brought to an end when food is ingested, because food is “consumma- 
tory.” That is, food restores the bodily depletion, reduces drive, and re- 
moves the associated stimuli, bringing the cycle to an end. Primary rein- 
forcement is interpreted as related to the last portion of the deprivation- 
activity-satiation cycle. That is, a primary reinforcing state of affairs is one 
that reduces drive through satisfying need. We shall see that not all re- 


3 The issues are met so differently by cognitive theorists that it would be very diffi- 


ir views in thi on. 

e trent their views ia Mig i be used broadly or narrowly. We have favored 
the adet and most nearly neutral definition of raaioroement, pernime it to 
refer to the presentation of the stimulus that e as the uncon irion ea us 
in classical conditioning or as the rewarding stimul er — con: been. 
This broad use permits speaking of the number of rein — in efore 
a conditioned response appears in classical conditioning. N a i (1955), 
page 48, recommends a narrower meaning, for he oe an i = ent response 
before it is permissible to speak of reinforcement. J ore specialized meanings of 
reinforcement in what follows will be clear from the context. 
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inforcement theorists hold that drive reduction is necessary for learning, 
yet drive reduction remains the basic paradigm for explaining why a 
reinforcing event is reinforcing. 

In the recent literature of reinforcement theory, secondary rein- 
forcement has come to play an increasing role. Even in Hull's (1943) 
drive-reduction interpretation, secondary reinforcement did not have to 
be drive-reducing. Its effectiveness, however, was said to wear off unless 
it was sustained by association with primary reinforcement. Hence the 
mechanism of primary reinforcement remained of central interest. 

In the early 1950’s a number of investigators began to report the 
results of experiments designed to study the reinforcement process more 
directly than it had been studied before. Suspicion became aroused that 
much that had been classed as primary reinforcement was really second- 
ary. For example, it takes a long time after eating before the food pro- 
duces nutritive changes in the body that will alter materially the tissue 
state associated with food depletion. Yet the reinforcing action is more 
prompt than this, Possibly even food—the standard incentive for “pri- 
mary” reinforcement—is actually yielding relationships based on secondary 
reinforcement, or, as an alternative interpretation, possibly need reduc- 
tion is not essential even to primary reinforcement. These doubts and 


queries have led to a variety of experiments and a variety of interpre- 
tations. 


Sheffield’s Prepotent-Response Interpretation 
In order to test the need 


h that incentive was appropriate. Sheffield 
and Roby (1950) showed that 
could serve as a reward in rat learning, 
of hunger was involved. Later experi 
ing value of a sweet taste depended p 
consummatory activity (Sheffield, Rob 
Wulff, and Backer (1951) 
learn to run and leap over 
females even though no eja 


depended upon the amount 
J 
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cur. These experiments seriously question the importance for reinforce- 
ment of satisfying primary drives either through making up depletions 
(as in satisfying hunger) or through relieving excess tensions (as in 
discharges from the male sex organs). 

Sheffield, in offering the theory of prepotent responses, is very close 
to Guthrie, whose views he has often defended. He has used these experi- 
ments, in fact, in support of a contiguity interpretation of learning.® 


Neal E. Millers Theory of Stimulus-Intensity Reduction 


Because, for Miller, any strong stimulus can become a drive and im- 
pel to action, his drive-reduction theory of reinforcement is really a 
stimulus-intensity-reduction theory. 

We have learned that it is too much to expect to find a crucial 
experiment that will resolve our disputes once and for all. Miller’s reply 
to Sheffield’s experiments is contained in the results of a series of experi- 
ments in which the consequences of food in the mouth can be compared 
with food in the stomach as reinforcing agents. If food injected into the 
stomach is reinforcing, this is obviously a threat to Sheffield’s prepotent 
response theory. For example, using the stomach-fistula method, Miller 
and Kessen (1952) taught rats a simple T-maze when milk followed 
choice of the correct side and normal saline solution followed choice 
of the incorrect side. When the incentive substances were received by 
mouth the animals learned faster, but they also learned when the rein- 
forcement was received by stomach (Figure 60).7 The authors believe 
that their results support the drive-reduction theory and refute Sheffield’s 
prepotent-response hypothesis. That learning was faster by mouth would 
be expected from the support given to the learning via secondary rein- 
forcement. ; 

In developing his form of drive-reduction theory, Miller has strug- 
gled particularly with the problems created by learned drive (fear) based 
upon pain. The empirical situation is well established. It is possible to 
shock a rat in the white compartment of a two-compartment box, and 
arouse fear. Placed again in the box, without shock, he shows all the 
signs of a “conditioned” fear. Given an opportunity to jump a hurdle, 
turn a wheel, or in some other manner find his way out of the compart- 
ment to a “safe” one in which he has not been shocked, he “learns” this 


5 Sheffield (1951). i 50), Miller (1951 
: i 41), Dollard and Miller (1950), Miller ( 95la) (1951b). 
eee yes nee to reduce subsequent food consumption when taken 
by Saer or a by stomach fistula, by Miller, Murray, and Roberts (1955), 
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ss f escape. The drive reduction that serves as the reinforcement is 
m ae f fear.* To hold consistently to a drive-reduction theory of 
ae oman is mecussusy to account for the rat’s fear of the compart- 
re he was shocked. It does not do to fall back upon classical 
Sais en to account for the rat’s agitation as a substitution of the 
co 
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y mouth immediately and 
n (1952), page 560, 


previously neutral compartment for the shock that cause 
in the box-shock trials. Miller recognizes that in order t 
he must assume that the agitated responses are reinfor 
responses are, namely, by stimulus reduction, hence in this case by 


termination of the painful stimulus. He adopts this interpretation of what 
happens, though he confesses some uneasiness about it.? 


Because strong stimulation is not necessaril 
state, Miller’s theory has a genuine advantage over Hull’s in that it 


yields a unified interpretation of primary and learned drives and of pri- 
mary and secondary reinforcement. In his words: 


d the agitation 
o be consistent 
ced as all other 


y based upon a need 


Thus the basic mechanism of motivation ( 
for primary and learned drives, and the basic 
reduction in strong stimulation) 


strong stimulation) 
mechanism of reinfo 
is the same for primary and learne 
8 Miller (1948a). See also Brown and Jacobs (1949). 

® Miller (195la), page 875. 

10 Miller (1951b), page 440. The attempt to 


provide a demonstration of a learned 
drive based on hunger has not proved successful, Myers and Miller (1954), 


is the same 
reement (a 
d rewards,19 
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Mowrer’s Two-Factor Theory 

Mowrer early saw the possibility that much learning could be ac- 
counted for on the basis of reinforcement motivated by acquired fear, 
or by anxiety, as he then called it. His was the first clear statement of 
the anxiety-reduction or fear-reduction theory of reinforcement.'! Re- 
flecting upon the same problem that bothered Miller, the problem of how 
fear is acquired via pain, Mowrer later came to reject the drive-reduc- 
tion theory, and returned to the familiar stimulus-substitution theory of 
classical conditioning. But he saved the drive-reduction theory for instru- 
mental learning, and thus came to espouse a two-factor theory.** 

The interpretation of learning as a dual process was not new. We 
met it in Thorndike’s associative shifting and learning under the law of 
effect. We met it also in Skinner's two types, Type S and Type R. A some- 
what similar proposal had been made by Schlosberg (1937). Mowrer’s 
espousal of the dual theory was important, however, because it repre- 
sented a conversion to the position by one who had been rather closely 
identified with the drive-reduction unified theory. He thus set himself 
in direct opposition to the views of Miller, although both men were pro- 
ducing related experimental results. 

Mowrer, in elaborating the two-factor theory, has used the term 


sign learning for simple conditioning of the stimulus-substitution or as- 
sociative-shifting variety, and solution learning for trial-and-error learn- 
ing under reinforcement. He writes: 


In order to make a two-factor theory work, “conditioning,” it now appears, 
must be restricted to the process whereby emotions, meanings, attitudes, ap- 
petites, and cognitions are acquired, with “solution learning applying to all cases 


of overt instrumental habit formation.** 

Mowrer believed that he found support for his theory in the differ- 
unction between the autonomic and the central nervous system. 
of which fear is an example, are largely involuntary 
and are chiefly under the control of the autonomic system, Problem-solv- 
ing responses, instrumental responses, are largely voluntary, skeletal, 
under the control of the central nervous system. Hence it would not be 
surprising if in the course of evolution different principles of learning 


ences in f 
Emotional responses, 


i 7 1950). He a 
ae ne oe were together at Yale they collaborated on a demonstration 
aratus to exhibit the learned 
and Miller (1942). 7 
12 Mowrer (1947) (1950) (1951) ( 1954b). 
13 Mowrer (1951), page 352. 
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om have developed, their dissimilarities showing in the differing 
m 
ah 3 7 =- ees Mowrer’s interpretation and Miller's. 

oer ee ; A unified theory, runs somewhat up against the 
le. a experience. That is, we seem ordinarily to be ap- 
as = 5 ied exceed onset of pain, and it is hard to believe 
a z n strengthened because pain subsides quickly. In order 
"ee ets A that conditioning of fear depends upon the termina- 
ag = a A and Suter ™ performed an experiment in which a 
Wee 2 on pet as the conditioned stimulus. The subject could 
ee toi tr running to the opposite end of the apparatus during 
tee as of the persisting conditioned stimulus, If however, the 
alain (as it did at the end of 5 seconds) there was no way 
= E ee it continued for 10 seconds, The two conditions of the 
eae were (1) terminating the conditioned stimulus at the onset 
ex 


of the shock, and (2) terminating the conditioned stimulus at the end of 

k. Mowrer argued that Miller’s theory should produce greater 
y = ing under condition (2), since the end of shock is reinforcing; 
etn ee would predict no difference in the two methods, since 
eon with Sat is common to both, and is all that matters. The 
Taia confirmed Mowrer’s deductions from his theory, 


in that there was 
igni i he two groups. 

significant difference between t r 
i Miller (195la) rejected the interpretation of the Mow 
experiment, basing his case in part on criticisms by Kendl 
Sheffield (1951), and on some results of Sullivan (1950) 
support his own theory. We must leave this issue as unre: 
Mowrer’s theory is currently under: 


rer and Suter 
er (1951) and 
that appear to 
solved, 


going revision, but the new theory 
is too recent for critical exposition here. The two factors of the original 


tsion have given way to two new factors: incremental fear conditionin 
ie tgs . 
(secondary motivation), and decremental fear conditioning (secondary 
reinforcement). In the new theory, all learning is sign learning.1* 


Spence’s Interpretation of the Role of Reinforcement 
In his recent formulation of his position in the Silli: 
Yale, Spence (1955) has reiterated his lack of active in 
M ceneni works, for example, whether or not it is 


man Lectures at 
terest in how re- 
drive-reducing or 


i r (1950), pages 278-287. 
h eo ene, endene on Mowrer’s theory, see Mowrer and § 
d aes and Aiken (1954). On Miller's side, see Zeaman, D 
an 
(1954). 


18 Mowrer (1955). 


olomon (1954) 
eane, and Wegner 
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need-reducing. He has preferred all along to adopt the empirical law 
of effect (in this agreeing with Skinner) and then to deal with quanti- 
tative relationships, using (unlike Skinner) an approach via intervening 
variables. 

Because Spence’s views are commonly held to be close to those of 
Hull, he is careful to point out what differences there are between his 
general theoretical preferences and those of Hull. He makes four points: 

1. In the first place, the formal hypothetico-deductive style, as used 
by Hull and others (1940), did not appeal to him as appropriate at this 
stage of learning theory. Along with Bergmann * he showed that Hull’s 
theoretical successes lay rather in the intervening-variable approach, and 
it was the article in which they pointed this out (and the extensive cor- 
respondence then being carried on between Hull and Spence) that led 
Hull in his 1943 book to characterize his method in terms of intervening 
variables and to borrow this term from Tolman. 

2. Spence felt that Hull went farther than necessary in attempting 
to give neurophysiological specification to his intervening variables. 
Although Spence prefers to remain neutral with respect to physiological 
interpretations, he would have no objection, of course, to genuinely fruit- 
ful physiological concepts. 

3. Whereas Hull adopted reduction in drive strength (or reduction 
as essential to reinforcement, Spence did not commit 


in drive stimuli) 
s, holding instead merely to the empirical law of 


himself on these issue 


effect. 
4, Finally, Spence prefers to develop his position step-by-step, with 


eorizing always very close to data. Hull was rather 
ke guesses in the form of “postulates” on the 
vidence. Spence’s objectives are stated thus: 


his ventures into th 
more willing than he to ma 
basis of minimum empirical e 


That is, the initial objective is to develop a set or system of concepts and 
ostulated interrelationships which, in combination with the different initial and 
Ponda conditions specific to each experimental situation, will provide for the 
deduction of the several kinds of conditioning laws that have been obtained. It 
is hoped further that this theory, along with the necessary additional composition 
rules, will permit one to account for more complex learning phenomena.** 


Spence indicates two reasons why, over the years, he was not actively 
interested in the mechanism of reinforcement. The first was his interest in 


quantification, which he repeatedly pointed out did not require a theory 
of reinforcement.” The second was the difficulty that showed up in 


e (1941 


b 
17 Bergmann and Spence i from manuscript (Lecture 8). 


18 Spence (1955), quo! 
19 E.g., Spence (1952b). 
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experiments on latent learning of ever finding an experimental situation 
totally free of reinforcing events. It was the effort to explain the various 
things happening in latent-learning experiments that led to an interpre" 
tation of reinforcement important in Spence’s contemporary theorizing.? 

The contemporary theory accepts the association of stimuli and re- 
sponses through their contiguous occurrence. Events at the goal are 
important. The conditioning that takes place in the goal-box is seinforced 
(as Pavlov’s dog's salivation is reinforced) and no special theory of this 
reinforcement is presented. That is, for conditioning to occur, the sub- 
ject must respond to the goal-object either b 
perceiving it. This reinforcing act 
tional goal responses (re) 


ry. The instrumental responses 


themselves, in this chain, are strengthened merely because they occur, 


and not because they are reinforced through some act at the end of the 
chain. The reinforcing act at the end of the chain strengthens the r¢’s, 
and these affect earlier links of the chain as motivators rather than as 
contributors to habit strength. Thus the re-se mechanism becomes iden- 
tified in Spence’s theory with Hull’s incentive motivation (K) rather 
than with habit strength (H). Differences in reaction potential of re- 


sponses at different distances from the reinforcement end of the behavior 


chain are thus due to differences in the K factor rather than to differences 
in H. 


Because reinforcement may be necessary for classical conditionin 
and is not decisive for instrumental learning, Spence toys with the no- 
tion (without committing himself to it) that he may have a two-factor 
theory just the opposite of Mowrer’s: 


It is interesting to note that if one were to adopt a theory that reinforcement 
(e.g., need reduction) plays a decisive role in the acquisition of habit strength of 
classical conditioned responses but not in the case of instrumental conditioned 
responses, one would have a two-factor theory that is 


exactly the opposite of th 
well known two-factor theory espoused by Schlosberg, Mowrer ane others,21 $ 
Because Spence’s theo 


ry is becoming a major comp 
theories, and is already le 


etitor to other 
ading to relevant experimental 


work, it is de- 


e years, and h 
g- Kendler, 1946). 


as been mentioned in 
ann, and Lippitt (1950). See 


It was first elaborated 
also Spence (1954a) 


21 Spence (1955), quoted from manuscript (Lecture 5), 
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sirable to see an example of how it works out in quantitative studies. 
The theory does not, in fact, commit Spence to an interpretation of the 
mechanism of reinforcement for classical conditioning. It does, however. 
require that he demonstrate that the K factor (based on anticipatory goal 
responses) acts as drive does upon excitatory potential rather than as 
habit strength does. 

If K acts as a drive, it will have a multiplicative relationship to H 
in the determination of E. That is, other things being equal, 


E=HxkK (1) 


where E is reaction potential 
H is habit strength 


and K is the incentive motivator based on the fractional goal response 
Equation (1) above is written in terms of intervening variables. 
What are the experimental variables correlated with these intervening 
5 e . : ios 
variables? The diagram in Figure 61 shows how experimental variables 
and resulting behavior are related to the intervening variables. 


Experimental 
variables 


Intervening 
variables 


Response R 
variables 
experimental variables to intervening variables in the 


FIGURE 61. Relation of ; 
determination of K as a multiplier of habit strength. Spence (1955), Lecture 5. 


s assumed to be related only to N, the number of 
whether goal responses or not. Hence we are in- 
he determiners of K, if we are to understand E. 


Habit strength i 
response occurrences, 
terested primarily in t 
These are as follows: 

Ng = the number of classical conditioning trials, i.e., the number 
of times the subject enters the goal-box and responds to 


s or sees) the goal-object. 


(consume: 
f goal substance consumed (number of pellets, 


Wea = amount 0 
weight or volume of food). 
as yet unknown variables that presumably de- 


vigor of the consummatory response, e.g., the 
sweetness of the object, the amount of sucking effort required 
to obtain the object in the case of liquids, possibly the hard- 
ness of the goal-object, and so on. 


X,...Xy=a number of 
termine the 
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: The presumption is that experimental Waxinticins of reinforcing agents 
can be shown, by way of properly devised experimental tests, to affect 
the nature of K. If the habit strength of re is increased, K should be in- 
creased. The relevant K may also be affected by the vigor of the par- 
ticular rę that is strengthened, that is, the particular component that 
gets conditioned from out of all the activity going on in the goal box. 
The theory implies that the differences in reaction potential of re- 
sponses at different distances from reinforcement will be due to differ- 
ences in K rather than in H, and this theory leads to a number of de- 


ductions. In order to see how these deductions are submitted to test, we 


E=H(D +K) (2) 
Now we turn to an experiment. Suppose we are interested in the 
running speed of a rat in a l-foot section of a straight runway, the 
section beginning 1 foot from rt, and 4 feet from the goal. This 
running speed is either reinforced as a habit by what happens in the goal 


illustrated here, for the quantitative deduc- 
tions of the two theories differ. 


Expressed in reinforcement term 


s (Hull’s 1943 theory), the excita- 
tory potentials for shorter and longe 


r distances from the goal will be: 


Es = Hs x D (3) 
EL = H, xD (4) 
Subtracting the two equations, we find: 


Es — E, =D x (Hy — H,) (5) 


> at 


Equation (5) shows that the differences in excitatory potential 
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a given level of practice, will be a function of drive. Because running 
speed is assumed to be a linear function of excitatory potential, this 
could be tested experimentally. If (in appropriate units) drive level 
doubles, running speed should also double. This would be a simple 
Hullian inference. 

If we go through similar steps in accordance with Spence’s theory 
we come out with a different inference. According to Spence: Í 


Es =H x (D + Ks) (6) 
EL =H x (D + Kz) (7) 

Subtracting these two equations, we have: 
Es — Es = H X (Ks — Kz) (8) 


Equation (8), by contrast with Equation (5), shows that at differ- 
ent levels of practice (represented by H) the differences in reaction 
potential between the shorter and longer path will be independent of 
drive level, and a function solely of differences in K. 

The experimental test to determine whether Equation (5) or Equa- 
tion (8) holds is straightforward. Run the rats with two delays under 
two drive levels, holding everything else constant, and see whether the 
differences in running speed under the two delays diverge or remain 


` constant with changes in drive. If they diverge, the change with drive is 


more likely to be due to H, as implied in Hull’s theory and Equation 
(5). If the differences remain constant, support is given to Equation 
(8) and to the interpretation that running speed depends on K rather 


than H. A preliminary test run in Spence’s laboratory yielded the results 


shown in Figure 62, confirming his views, as implied in Equation (8). 
The figure shows the results for 9 rats per group on the first daily trial 
over the last three training days (days 10, 11, and 12) for the four 


groups of subjects. ; 
We shall meet some other derivations from this theory in the next 


chapter. For the present, this suffices to indicate how the theory is 


translated into experimentable form. 
How can Spence’s theory best be characterized, in relation to other 


reinforcement theories? Let us quote his own words: 

This theory is, then, a reinforcement theory so far as excitatory potential is 
concerned; that is ‘the presence or absence of a reinforcer, and differences in its 
properties ‘when present, do make a difference in the strength of the instrumental 
response. It is not, however, 2 reinforcement theory in the traditional sense of the 
term, for the habit or associative factor is not assumed to vary with variations in 

š a 

reinforcement.?? 


22 Spence (1955), quoted from manuscript (Lecture 5). 
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FIGURE 62. Test to determine whether or not drive level affects amou 
crimination between two delays, Bec: 
constant, Spence’s theory that D a 
Lecture 5. 


S nt of dis- 
ause the differences at low and high drive remain 


nd K are additive is supported. Spence (1955), 


Sewards Two-Moment Theory of Motivated Learning 
In a series of theoretical papers be 
tempted to unravel the problems of dri 
ing of instrumental behavior under both appe 
The early statement of his the 


was upon the fractional go 
tation rather than association. This is 
with: the incentive motivation is a co; 


it. Seward 
took as his point of departure Hull’s (1950a) revi 
Spence’s (1947) interpretation of delayed. 
like Spence, was worried about latent learning, 
similar theories. 
Seward’s general position is as follows: 


23 The chief papers in the series are Seward (1950) (1951) (1952) (1953), 
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1. All learning is based on a simple law of association, without 
reference to reinforcement. 

2. Conditioning of the fractional goal response is a special case of 
such learning, limited by the amount and delay of reward. 

8. This conditioning of the fractional goal response results in a con- 
struct (at first called tertiary motivation, but the term was later dropped ) 
which enters into the equation for sEr as a multiplier along with D and 
sHr. 

Because this multiplier facilitates the act in progress, instrumental 
activity is affected by what happens at the goal. 

The modifications of Hull’s position by Spence and by Seward are 
readily seen to be very similar. Drive-reduction reinforcement of instru- 
mental responses is dropped by both, and fractional goal responses enter 
as components of motivation. Beyond that the theories differ. 

Seward (1952) ran into trouble in applying his theory symmetri- 
cally to learning based on either appetitive or aversive drives.** He intro- 
duced a modification in the form of a two-moment theory of motivation. 
As he reviewed the events in a cycle of motivated behavior, it appeared 
to him that most activity begins with something aversive and ends with 
something satisfying. This is, of course, coherent with a common inter- 
pretation of the need-drive-incentive pattern, for drive is conceived as a 
state of disturbed homeostasis, while the incentive restores the equilib- 
rium. The problem reduces, then, to two subproblems: how drives are 
acquired and how goals are acquired, with the implications for appe- 
titive and aversive learning. Through ordinary conditioning processes, a 
drive surrogate (rsp) is acquired as well as a goal surrogate (rsa). These 
are the two moments of motivation which are now said to contribute to 
stimulus-intensity dynamism of a motive (Vm), as follows: 

Vin = @(18q + Tsp) +k 


This V,, is a multiplier of habit strength, as in Hull’s equations.” 

The advantage of the two-moment theory is that relative weights can 
be assigned to rsa and to rsp to account for appetitive and aversive prob- 
lems. In problems such as escape from pain and pain avoidance, empha- 


sis is placed upon rsp, with rsq playing a minor role. The interpretation of 


24 Spence has not yet dealt specifically with escape and avoidance learning in the 


is theory. reat 

i 1 pinent DE os whether produced by external events, by the organism’s 
i i by drive, must have both cue aspects and stimulus-intensity dynamism 
Peete. ie A at deal of arbitrariness enters into terms such as D, K, and V in 


Hais system. ‘or any derivatives from it. Seward could have chosen to tie his terms to 
K, as Spence does, instead of to V. 
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experiments such as those of Miller (1948a) and Brown and Jacobs 
(1949) rests almost entirely on acquired fear, a form of rsp. But in the 
case of instrumental learning for reward, as in food-rewarded hunger, 
there has been little success in finding learned drives (fso), so that 
most of the burden is taken on by learning related to the goal response 
(rse). The separation of these two moments of motivation is a very 
natural development from familiar interpretations of drive and goal, and 
it appears to open up a useful line of investi 


gation in harmonizing learn- 
ing under reward and punishment. 


Concluding Remarks on Reinforcement 


We come out with 


post-Hullian interpretations of reinforcement as 
follows: 


l. Single-factor theories 


a. Contiguity only (Sheffield, Seward) 
b. Drive reduction only (N. E. Miller) 
2. Dual theories 


a. Drive reduction for instrumental acts only ( 


Mowrer) 
b. Drive reduction for 


classical conditioning only (Spence) (?) 


It is evident that agreement has not yet been achieved! 


NEEDS, DRIVES, AND INCENTIVES 


We have seen that the problem 
the reinterpretation of reinforcemen 
tion of motivation. 


s that have arisen in connection with 
t are also problems in the interpreta- 


The need-drive-incentive pattern, in its sim 


plest form, asserts that 
physiological needs are created b 


(or by tissue injury, as 
hese needs give rise to drives, A drive, 
> until an incentive 
reduces the 
deprivation 
hold to this 
cle whether 
this pattern 
Cent studies 
e some who 
need-drive- 


drive by satisfying the need, that 
or by getting rid of the noxious sti 
general pattern of events as a mo 
or not one accepts a reinforcemen 
serves as a crude approximation ti 
show that the details are very complex indeed, and there ar, 
have come to believe that, as a motivational model, the 
incentive pattern is actually misleading. 


is, by making up for the 
mulation. It is possible to 
del of the motivational cy 
t theory of learning. While 
o what actually happens, re 


26 Seward (1951), Myers and Miller (1954). 
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Drives as Correlates of Physiological Needs 


We may define “need” as the physiological state of deprivation or of 
tissue injury, and “drive” as the psychological consequence of this state. 
We know, in general, that the longer we have abstained from food, the 
hungrier we get, and that rats deprived of food tend to run faster in an 
activity cage than well-fed rats. As we examine the details of the need- 
drive correlation, however, we find that we cannot assume a one-to-one 
correspondence between need level and drive level. 

We have already seen how Hull (1952a) introduced an inanition 
factor to account for the fact that as animals become weakened by de- 
privation their food-seeking drives are also weakened. Prolonged starva- 
tion is thus one exceptional condition that disturbs the relationship be- 
tween need and drive. 

Other qualifications of the need-drive correlation are shown by the 
experiments on artificial drive reduction, where need is not satisfied (as 
in saccharin feeding), or where a common amount of need reduction 


leads to varying amounts of drive reduction (as in the fistula-feeding 


experiments ).*7 
There must be some sort of “ feedback” mechanism within the nervous 


system that tells the organism when it has had enough to eat or to drink, 
prior to the metabolic assimilation of the food or liquid. The search for 
such mechanisms helps to throw light upon the relationship between 
need and drive. We have met the suggestion by Heron (1949) that dis- 
tention of the stomach, whether by water or by food, seemed to be one 
basis for drive reduction (somewhat independent of need reduction). 
If Heron’s theory were true, it would not deny the usual correlation be- 
tween need and drive, for usually the stomach distention would be 
produced by substances appropriate to the satisfaction of need, But 
doubts concerning the generality of Heron’s results have been raised by 
Miller and Kessen (1954), who inserted a small balloon on the end of 
the stomach fistula of rats prepared as in their previous experiments. 
They could now compare the behavior of the rats when the stomach was 
distended by milk with their behavior when the stomach was distended 
by the inflated balloon. They found in experiments with a T-maze that 
the results did not support Heron’s claims. The milk served as a reward, 


27 In addition to the experime 
may note the related experimen 
Kohn (1951), Berkun, Kessen a 

eated for the thirst drive by Samp 
lished). 


nts by Miller and Kessen (1952), already cited, we 
ts by Hull, Livingston, Rouse, and Barker (1951), 
nd Miller 1952). The experiments have been re- 
liner and Woodrow, in Miller’s laboratory (unpub- 
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and led to the choice of that arm of the T that was followed by milk 
injection; the inflated balloon served as a punishment, and led to avoid- 
ance of the side that led to balloon inflation. Hence stomach distention 
alone is not the condition resulting in drive reduction of ei 
or thirst. 

Another feedback mechanism has been located in the region of the 
hypothalamus, clearly affecting the thirst drive. Andersson (1953) 
that the injection of a saline solution into the third ventricle of the brain 
(the region of the hypothalamus) produced excessive drinking in goats. 
The experiment has been repeated and extended in Miller’s ] 
with cats as subjects.?8 Hypertonic saline solution (2 
jected into the ventricle produced excessive water consumption, while 
pure water produced a minimum of drinking. Isotonic salt solution fell 
between the two. The amounts injected were very small (0.15 ce. of 

a sort of “detector” mechanism in the 


nervous system that may prove to give important clues as to how the 
organism adjusts its liquid input to its needs. 


ther hunger 
showed 


aboratory 
per cent NaCl) in- 


Drives as Goads to Activity 


Tt has usually been assumed that restless a 
heightened drive, and the activity-cage p 
used as an indica 


w of the well-estab- 
tive food preferences, 
ing of thresholds for 
sence of food would 
kind of stimulation, 


lished findings that bodily deficiencie. 
a result that might come about t 
specific kinds of stimulation. Total prolonged ab 
correspondingly lower thresholds for almost any 
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FIGURE 63. Effects of deprivation on “spontaneous” activity as compared with 
activity when an environmental change is introduced. Campbell and Sheffield (1953), 
page 321. 


Drives as Cues to Differential Response 

In addition to whatever drives do to reduce thresholds or to stimu- 
late activity, they also help to direct activity through providing various 
differential cues. For example, an animal can be taught to turn one way 
when hungry, another way when thirsty. Bailey and Porter (1955) have 
taught cats to perform one instrumental response to get water when 
thirsty, and another to get food when hungry. Eight of nine cats were 
highly successful in making the discrimination. 

Bailey (1954) performed an experiment with rats to determine 
whether or not hunger or thirst could provide differential cues for a 
discrimination rewarded not by food or water but by escape from intense 
light. The rats had to press one of two panels in order to turn off the 
light. Which panel was correct was determined by which drive was 
active. Thus the drives served as cues exactly as if they were different 
figures on the panels. Three groups were able to discriminate (a) thirst 
from hunger, (b) thirst from satiation, and (c) hunger from satiation. 

In the same series of experiments, Bailey also used the presence or 
absence of a 10,000 cycle tone as a cue for the differential response to the 
panels. If the tone was presented shortly before the time of response, re- 
sults were very similar to those with drive discrimination. If the tone 
began to stimulate the rat a long time before he was placed in the 
apparatus, he did not learn. The difference between the long-delay and 
short-delay tone is used to interpret the difficulty sometimes found in 
drive-discrimination experiments, because drives are long-persistent 
stimuli. The argument that long-persistent stimuli are hard to discrimi- 
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= is weakened by the finding that drive discriminations were no more 
difficult than short-delay tone discriminations. Supplementary conjec- 


tures are required, possibly about cue distinctiveness, if the original argu- 
ment is to hold. 


Drive Interaction and General Drive Level 


Though drives provide differential cues, all drives may have some- 
thing in common. Each may contribute its share to a general drive level, 
as implied in the theories of Hull and of Spence. This general drive level 
(D) is the multiplier used in their equations to produce reaction poten- 
tial from habit strength. The theoretical problem of a general drive level 
contributed to by specific drives is subject to a variety of experimental 
approaches. 

Verplanck and Hayes (1953) conducted a series of experiments on 
food and water deprivation in rats in which the animals later, under 
conditions varying from experiment to experiment, had an Opportunity to 
eat and drink. They found a very complex picture of drive interaction 
as measured by consummatory activity. For example, the amount of food 


eaten by water-deprived rats is some 60 per cent of the amount eaten 
by those which have had continuous access to water, 


of water drunk by food-deprived rats is onl 
animals which 


while the amount 


y 40 per cent of that drunk by 
have had continuous access to food. If the animal is 
allowed to drink, after being deprived of water for 


22 hours, so that its 
water needs are satisfied, it is motivated to eat about as much food as if 
it had been deprived of food for the same period. S 


imilarly, termination 
of a fast, without water deprivation, leads to water consumption ap- 


proaching that of water drunk after a like period of water deprivation. 
The authors believe these results to be significant for studies of latent 
learning, which often try to distinguish too sharply between “hungry” 
and “thirsty” rats. = 

Considerable success has been achieved in studies originating at 
the University of Iowa, and largely carried out there, which treat some 
generalized human motives according to the intervening variable (D) 
and then make predictions as to consequences for conditionin diss 
crimination, and verbal learning. Most of the st B 


and \ udies have used a meas- 
ure of “anxiety,” based on Taylor’s anxiety scale.?° Others have used a 


of achievement motive in 
measure somewhat the same way. Farber 


29 A number of these studies have been listed in Footnote 32, Ch; 
summaries, in relation to drive theory, can be found i ost 


n Fark napter 9. Theoretical 
30 E.g., Hurley (1953). Cited by Farber (1953), er (1954), Spence (1955). 
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(1953) points out that possibly achievement scores should not be used 
as indicative of drive level, but rather as indicative of a general response 
tendency influencing performance in a variety of situations. He suggests 
that it might possibly be referred to as an “achievement-oriented habit.” 

In his very thoughtful review of anxiety as a drive state, Farber 
(1954) rightly points out the successes that have been achieved through 
treating anxiety parsimoniously as generalized drive (D). He notes, 
however, that anxiety must be a special kind of D; it could not be 
claimed that just any heightened drive would lead to precisely the same 
consequences. His doubts about the status of achievement motive are 
perhaps equally applicable to anxiety. The detailed studies which show 
how test-measured anxiety acts as a cue or specific set of response ten- 
dencies (in addition to its enhancement effect as generalized drive) have 
not yet been reported.™ 

In developing the relationship between reaction potential, drive, and 
habit formation, the later theorizing of both Hull and Spence limited 
the effect of drive (D) to an influence upon reaction potential, not 
upon habit strength. An experiment by Campbell and Kraeling (1954) 
calls this into question. Rats trained to run in a straight alley under 
high and low strengths of hunger drive were tested in extinction trials 
according to two measures: resistance to extinction (measured by trials) 
and amplitude (measured by running speed). The Hull-Spence posi- 
tion was confirmed when resistance to extinction was used as a criterion, 
but with amplitude there was greater response strength during extinc- 
tion with stronger training drive level, regardless of the drive level dur- 
ing extinction. This experiment is merely one mog showing how un- 
certain are our generalizations based on the drive-reinforcement para- 


digm.?? 


The Positive Role of Incentives 
The words “goal,” “Jure,” “incentive” seem to imply some kind of 
directional orjentation toward a state or condition that is attractive. We 


: ` Js, By a curious chain of circum- 
say that organisms strive toward goals. By s 


ome experiments to determine whether anxiety of this 


g i yhether it acts more as if it merel; 
i en neralized excitement or whet’ c A ly 
pa ma ie De shold foe emotional experiences. Experiments in Eysenck’s laboratory 
pee z iet Sak selective effects for different personality types—contrasting a 
ce ape rainy P hysterical” type. s Pee 
eee type e goer out that their experimental result also criticizes 
H ee ae rE ction and amplitude of response as alternative meas- 
Fie ae F E oi for the functional relationship of experimental variables to 
reaction > ; ; i i 
response differ depending upon which measure is use 


31 Spence (1955) reports S 
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stances, this rather primitive nature of goal-experiences was largely 
lost sight of in the theorizing of the last decade or two. To be sure, it 
could not be ignored entirely, but striving and goal-seeking had to be 
derived from more primitive principles before the familiar facts could 
be recognized as valid. The circumstances that led away from attribu- 
ting positive qualities to incentives probably arose historically through 
the turn toward objectivity and away from conceptions that appeared 
tainted with animism or teleology. 

There is little doubt that the drive-reduction theory of motivation 
led to a predominantly negative role for incentives. According to this 
interpretation, food is eaten not because it tastes good but because it 
ends hunger-pangs. A compartment in which the rat is not shocked is 
reinforcing not because of anything attractive about its size, color, 
temperature, or fittings, but because it provides relief from a compart- 
ment in which shock has been given or is threatened. This negative 
position moved into the interpretation of human motivation as well. 
“At the level of ego-psychology,” writes Mowrer (1952) 
said to be only one master motive: anxiety” 
positive, like love, or avarice, or ambition, to 
from which we seek to escape, 


A rather one-sided position of this kind often gains wide currency 
because of controversies that develop amo 


ng those who hold the position, 
so that this point of agreement is not questioned. For example, this issue 
does not arise between Guthrie’s contigui 


ty theory and Hull’s drive-reduc- 
tion theory. Guthrie, too, was negative about the role of rewards, since 
all they did was change the situation, Psychoanalysis, which might be 


thought of as a somewhat competing position, tended also to describe 
adjustment negatively, adjustment depending upon anxiety reduction or 
escape from feelings of guilt. Even Lewin’s theory, which recognized the 
positive valences of some incentives, looked to tension reduction as the 
basis for motivated action. Support was given also through Cannon’s 
doctrine of homeostasis, which made it easy to view the function of 
motivation as the restoration of equilibrium through reduced tension. If 
all these factions accepted some form of tension redu 
reduction or mere change toward equilibrium as the b 
ing motivated action, it is not surprising that the r 
mained essentially negative, Concessions are alw. 
that it can be pointed out that there were so 


, “there may be 
(p. 423). Not something 
Spur one on—just anxiety 


ction or stimulus 
asis for understand- 
ole of incentives re- 
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for what was inherent in the incentive, but something providing relief. 
The incentive was seen as an avenue of escape from pain, anxiety, tension. 

The drive-reduction theory or tension-reduction theory, predomi- 
nant in mid-century, has been coming under increasing attack and re- 
examination by its friends as well as by its enemies. 

1. There have, of course, been critics of the drive-reduction theory 
all along. Because of its inviting nature and its acceptance by otherwise 
conflicting factions among the learning theorists, most of those who ob- 
jected were “voices crying in the wilderness.” They failed to get much of 
a hearing. McDougall (1928) argued that striving toward a goal was 
satisfying, and reaching it was relatively unimportant; Allport (1937) 
(1946) never did like the law of effect or the motivational theory related 
to it; Murphy (1947) wanted sensory and activity drives added to vis- 
ceral ones; Anderson (1941) wrote on the externalization of drive; Young 
(1949) felt that we paid too little attention to appetites in our over- 
attention to drives. These views did not do very much to stem the tide 


of the prevailing theory, though they may have sown dormant seeds of 


doubt that have more recently sprouted. 
2. A rather important feature of the background for the present ques- 


reduction theory has come from within the camp 
The entering wedge came through experiments 
on secondary reinforcement, as when Wolfe’s (1936) chimpanzees ac- 
cepted poker-chips as rewards instead of food. When Hull in 1943 did 
not require drive reduction in order for a secondary reinforcing agent 
to strengthen habit, he opened a way for recognizing the positive roles 
of incentives, though he was reluctant to follow this path. Instead, by 

ary reinforcers also stimulus-intensity re- 


1952 he had made second 
ducers. He later introduced (K), incentive motivation, as a further move 


in the same direction, although K applied to primary incentives only. 
Spence’s incentive motivation, related to Hull's K, while supposedly 
separable from the general drive level (D), is additive to D, and hence 
functions exactly as a component of drive. Spence’s theory gives recog- 
nition to a somewhat neglected role of incentives: their drive-enhancing 
role as well as their drive-reducing role. At least for perceived incentives, 
naturalistic observation tells us that they can enhance drive, as when a 
male dog is aroused by a receptive female in the neighborhood. Pre- 
viously experienced incentives, as in Spence’s theory, act similarly through 
the arousal of fractional anticipatory responses, even though the in- 
centive is not directly perceived. The incentive, in its role of enhancing 
drive, takes on qualities not suggested in its role as a drive reducer. 


tioning of the drive- 
of reinforcement theorists. 
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The drive-reduction theory is naturally strained if the same object 
can both enhance drive and reduce it. The paradox is partially resolved, 
however, if we recognize that more than one response is possible to the 
same object. Thus the smell of fresh bread ma 
eating the bread may reduce it. In the case 
problem is more difficult, for they are obvio 
their drive-enhancing and their drive-reducing qualities are harder to 
distinguish. Miller (1951a), struggling with this problem, suggested that 
perhaps the two effects alternated in some manner. Mowrer (1947), fac- 
ing the same problem, found it necessary to give up the drive-reduction 
interpretation as a unitary one, and go to his two-factor theory. 


We have already seen how the experimental evidence makes con- 
fident assertions about the need-drive 


than a few years ago. With this weakeni 


y increase appetite, while 
of secondary rewards the 
usly not need-reducing, so 


1c system. It can be aroused (1) 
d the act of eating, (2 
conceptually. Because these sev, 
may summate, hunger can be arouse 


) by stomach 
eral determiners 
attractive ap- 
emphasizes the 
ducing role, By 
the precise physi- 
d perceived incen- 


central motive state” (Morgan, 1943), 


ess significant an 


in their cages they learn to 


ugh no “bait” (no obvious reinforcement) is 
used. They show a great deal of curiosity and will work hard merely 
for an opportunity to “look,” 33 

Reflections upon these experiments have led Harlo 


; w to make strong 
uction and Physiological s 


a tates as the basis 
for motivation: 

There are logical reasons why a drive 
which emphasizes the role of internal, 
untenable as a motivational theory of | 


-reduction theor 
physiological-state 
earning. 

23 Butler (1953); Harlow and McClearn (1954), 
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The condition of strong drive is inimical to all but very limited aspects of 
learning—the learning the ways to reduce tension. . . . The hungry child is a 
most uncurious child, but after he has eaten and become thoroughly sated, his 
curiosity and all the learned responses associated with his curiosity take place. 


He interprets the evidence with monkeys to indicate that strong 
drive is quite as likely to inhibit learning as to facilitate it. These criti- 
cisms, coming from a physiologically-oriented psychologist, represent one 
of the extreme reactions to the position that shortly before was being so 
widely endorsed. 

It is possible that Harlow has gone too far. Events are complicated, 
and possibly some drive-incentive conditions are appropriately treated 
according to the drive-reduction pattern while others may not be. Fur- 
thermore, boredom may arouse a tension state that is relieved by any 
change in the environment, so that satisfying curiosity may be boredom- 
relieving, and thus drive-reducing.** 

Montgomery has performed a number of experiments on the ex- 
ploratory drive in rats, a drive closely related to curiosity. While explora- 
tory drive interacts with other drives such as hunger, thirst, and fear, it 
cannot be explained by them. Novel stimuli, for example, may arouse 
both fear and exploration, but after the prolonged exposure has reduced 
fear the exploratory drive may persist. Montgomery's hypothesis is sim- 


ply that novel stimuli arouse the exploratory drive that motivates ex- 


ploratory behavior.** 
A position like Harlow’s and the related one of Montgomery’s re- 


ceives some support from Woodworth (1947), who recognized that a 
motive to perceive was a prevalent one, and from Nissen (1954), who 
accepts a “biogenic drive to explore, to perceive, to know.” Another way 
that Nissen puts it is that capacity is its own motivation; an organism 
goes about functioning in the ways it is prepared to function.*7 

Still another support from the camp of physiological psychologists 
for a more positive interpretation of what happens at the goal comes 
from the experiments of Olds and Milner (1954) on positive reinforce- 
ment through electrical stimulation of part of the rat’s brain, the septal 
area below the corpus callosum and between the two lateral ventricles. 


ae age 25. 
a5 Cai cree nt be expected, Miller’s reply to Harlow, e.g., Myers and Miller 
> 953). 

UE E E Troin t 953a) (1953b) (1953c), Montgomery and Monkman 
(1955). i 

87 The amazing disturbances in hum 
stimulating environment shows something o 
ment, Bexton, Heron, and Scott (1954). 


an subjects restricted experimentally to an un- 
f this need for interplay with the environ- 
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Electrodes permanently imbedded could be made to deliver an alternat- 
ing current to this area if the rat pressed the lever of a Skinner box. The 
“shock” was found to be positively reinforcing. That is, the rat would 
press the lever as if being rewarded, provided the current was turned 
on. When the current was turned off, and pressing the lever had no ef- 
fect, typical extinction ensued. (Other areas of the brain can be found 
mildly rewarding, some neutral, some punishing. ) * 

The experiment demonstrates that it is possible to reward an animal 
without satisfying a physiological need or withdrawing a noxious stimu- 
lus. Thus reward appears to be separable from need and drive. Further 
work along these lines may prove helpful in determining just how re- 
ward and punishment work. At this point the significance of the find- 
ing is that incentives may have properties requiring their definition 
independently of need and drive. It is possible that the role of Olds’s elec- 


trical stimulation was that of affective arousal, that the stimulation 
produced a state that the rat found “pleasant.” 


If so, the study gives some 
physiological support for the 


theory next to be considered. 
4. A clear statement of the positive role of incentives, and a re- 
interpretation of the motivational sequence, has be 
Clelland and his collaborators.*° It is a theory of affective arousal, a 
pleasure-pain theory, thus returning, in some sense, to a “law of affect” 
rather than a “law of effect.” It cannot be dismissed simply as old- 
fashioned hedonism, for there is a genuine effort to deal with the problem 
of objectifying affective arousal. A motive is defined as “the redintegra- 
tion of a cue by the change in an affective situation.” 
definition, published earlier, defined a motive as “a s 
sociation, characterized by an anticipatory goal reaction and based on 
past association of certain cues with pleasure or pain.” # 

McClelland’s criticism of drive-reduction theories of motivation is 
of what we have called above their negative character, that is, their fail- 


en given by Mc- 


"* A less cryptic 
trong affective as- 


38 Delgado, Roberts, and Miller (1954) have confirmed with cats | i 
demel iat of parts of the tectal area, thalamus, and hiere se bee e 
mechanism appeared to be the arousal of fear (or pain). Pus, 

39 Olds (1955) has examined some of these possibilities, 

40 McClelland (1951), McClelland, Atkinson, Clark, and Lowell (1953), Atkinson 
(1954). : 


#1 McClelland and others (1953), page 28. 
* McClelland (1951), page 466. 
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pects of motivation. That is, he returns to the common-sense notion that 
we seck activities that are pleasurable because we enjoy the pleasurable 
state that they excite in us, and we dislike and avoid other activities 
precisely because they are annoying or unpleasant to experience. Hav- 
ing taken this general position, he is not willing merely to turn back the 
hands of the clock to an earlier hedonism. Instead he is attempting to 
build a new theory on scientific props of the same order of objectivity 
and validity as those supporting the drive-reduction theory. One of the 
facets of his theory is that slight changes in intensity of stimulation tend 
to arouse pleasant affect, extreme changes unpleasant affect. 

The main point is that McClelland has substituted affective arousal 
for drive reduction, and all motives are said to be cued by changes in 
affective situations. Furthermore, all motives are learned. The two mas- 
ter motives recognized by McClelland are appetite and anxiety. Con- 
tinuity with the prevailing theories is there, so far as anxiety is concerned, 
but added to it are the positive motives summarized under appetite. 

However the many suggestions and counter-suggestions will finally 
ure that the need-drive-incentive pattern will not 
emerge in uncomplicated form, Some satisfactory solution will have to 
be found for the treatment of positive wants and desires that impel men 
to heroic and daring action. Only a few men see in the passive Nirvana 


state the true bliss for which they long. 


come out, we may be s 


Summary of Reinforcement and Drive 


The confident espousal of a need-drive-incentive conception of mo- 
tivated behavior, and of a drive-reduction theory of reinforcement has 
given way toa searching analysis of the circumstances under which learn- 
ing occurs, whether through contiguous stimulation, drive reduction, 
stimulus-intensity reduction, some combination of these, or through some 
process as yet not understood. Secondary motivation and secondary rein- 
forcement studies are very important in these analyses. 

The most significant new direction in the study of motivation is away 
from tension-reduction interpretations to an emphasis upon the positive 
aspects of behavior related to preferred incentives or activities. 


SUPPLEMENTARY READINGS 


A list of supplementary readings on current developments is provided fol- 


lowing Chapter 13. 


Chapter 13. 


CURRENT DEVELOPMENTS: 
II. SOME PERSISTING ISSUES 


THE issues around the need-drive-incentive pattern, as discussed in the 
T preceding chapter, are all central to stimulus-response theories. The 
quarrels between stimulus-response theories and cognitive theories are 
somewhat more clearly represented in other issues. One of these, which 
experimenters have actively investigated for twenty years, has to do 
with the nature of discrimination learning in animals. This issue has been 
selected for review because it is not only central, but currently alive, 
Stimulus-response psychologists have often shown impatience with 
cognitive theorists, charging them with 
of their theories. We shall therefore co 
up the deficiency in formal statement 
Finally, 
contribution 
the current d 


lack of precision in the statement 
nsider some recent attempts to make 
of cognitive theories. 

there has been a renewal of interest in the old problem of the 


of neurophysiology to learning theory. This is the third of 
evelopments selected for review, 


DISCRIMINATION LEARNING IN ANIMALS 

Although discrimination learni 

ject of laboratory study, new interest in the problem developed after 
Köhler (1918) (1925) cal 

relational discrimination 


pened by Spence in two impor- 
tant papers which have led to a considerable literature of experiment 
and theory. 


The first of these * took up the problem raised by Krech, and showed 


48 Krechevsky (1932a) (1932b), 
44 Spence (1936). 
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how reinforcement and extinction could account for the non-random 
features of presolution learning. This paper led to distinction between 
two viewpoints that have come to be known as the continuity and non- 
continuity theories of discrimination learning. Continuity theory accounts 
for the present in terms of past associative learning. Non-continuity 
theory predicts sharp breaks in learning as the animal learner shifts from 
one systematic mode of behavior to another, or achieves insight. The 
continuity theory may predict sharp breaks also, under specified condi- 
tions, but they are always predictable on the basis of the prior experi- 
ences of the learner. Spence was on the side of continuity, Krech on the 
side of non-continuity. 

The second of Spence’s theoretical papers ‘* was concerned with the 
issue of discrimination based on the absolute versus the relative proper- 
ties of the discriminated stimuli, as brought to the fore by Kéhler’s trans- 
position experiments. Spence showed very ingeniously how the overlap- 
ping gradients of generalization of excitatory and inhibitory tendencies 
resulting from reinforcing responses to the positive member of the stimu- 
lus pair and extinguishing responses to the negative member could ac- 
count for transposition with similar stimuli. 

The nature of Spence’s analysis is illustrated by Figure 64. The tests 
ade after the gradients in the figure have been 
set up through reinforcing responses to 160 (the positive stimulus) and 
extinguishing responses to 100 (the wa stimulus), Transposition 
can be tested by selecting any pair of neighboring stimuli. If response is 
then to the larger of the two stimuli, transposition is demonstrated 
because the original training required response to the larger of the stimuli. 
Suppose we select the pair of stimuli 160 and 256. We learn from the 
diagram that the response tendency to 160 has a strength of 51.7 and to 
256 a strength of 72.1. Hence the tendency to respond to the larger stimu- 
lus (256) is greater than the tendency to respond to the smaller stimulus 
(160), even though the smaller (160) was the positive attains during 
training. Thus the fact of transposition is accounted for without any as- 
sumption of reaction to the relationship larger-than. é : 

Spence’s theory has the virtue over Köhler’s Het it can also predict 
the known failure of transposition with extreme stimuli. Suppose stimul 

selected as test stimuli. Transposition (reaction to 
409 and 655 a equire response to 655. But we learn from Figure 64 
ae i oi to 655 is 18.5, while the tendency to respond 
to 409 is 52.1. Thus transposition will break down, and response will 


be.to the smaller of the two stimuli. 


46 Spence ( 1937). 


for transposition are mM 
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Because Spence’s gradients were generated entirely on the basis of 
the absolute (specific) characteristics of the individual Stiraoli, he set 
himself against theories of discrimination based on relative characteris- 
tics.*® 

These two problems (continuity versus discontinuity; absolute versus 
relative discrimination) are closely related, but they are sufficiently 


separable for us to consider independently the status of the issues they 
represent. 


39 62 100 160 256 409 655 
2 p 


STIMULUS SIZE =- SQ.CHS. 


FIGURE 64. Spence’s theory of transposition (Spence, 1942). The visual stimuli 
to be discriminated in either training or testing are scaled along the baseline accord- 
ing to their areas, 

Reinforcement of responses to stimulus 160 during training produces the gener- 
alization curve represented by the solid line. That is, the tendency to respond is 
increased not only for responses to stimulus 160 but to neighboring stimuli in pro- 
portion to their proximity to 160. 

Non-reinforcement (extinction) of responses to stimulus 100 produces the corre- 
sponding inhibitory gradient represented by the broken line. 

The tendency to respond positively to any stimulus along the baseline can be 
computed by subtracting the Tik of the dotted vertical line from that of the solid 


vertical line at that position along the baseline. The results of this computation are 
indicated by the numerical values above each stimulus size. 


The text explains how the diagram is used to explain transposition, 


Current Status of the Continuity and Non-continuity Issue 


The original paper of Spence (1936), denying the discontinuity be- 
tween presolution performance and solution learning, has been followed 
by many experimental pro’s and con’s, of which only the more recent 
will be reviewed here.‘ 


Ehrenfreund (1948) conducted two experiments, the data of the 
first appearing to support the non-continuity theory, while the second 
supported the continuity theory. In the first experiment, rats were ulti- 


+6 He later granted that some circumstances might force discrimination on the 
basis of relative characteristics (Spence, 1952c), but only if the relationships could 
not be learned more readily on the basis of the absolute characteristics of the com- 
ponent patterns. In this connection, see Nissen (1953). 

‘7 A few of the more important earlier references, which cite the other literature, are: 
Krechevsky (1938), McCulloch (1939), Lashley (1942), Spence (1940) (1945). 
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mately required to learn to discriminate between an upright and an in- 
verted triangle, having first been given 40 trials in which responses to 
the (later) negative stimulus were rewarded. They learned the ultimate 
problem as rapidly as a control group whose prior practice was not con- 
tradictory. Hence there appeared to be some discontinuity between the 
presolution learning and the ultimate discrimination. In the second ex- 
periment, however, the experimental group was retarded in its later 
learning by its earlier contradictory practice. The apparent paradox was 
resolved by Ehrenfreund by pointing out that the arrangements of the 
first experiment precluded discriminably different retinal excitations from 
the later-to-be-discriminated stimulus cards during the presolution period. 
The later-to-be-discriminated stimuli appeared on the upper portion of 
the cards, and the animals during the presolution period were forming 
position habits without “paying attention” to the figures. The second 
experiment, by contrast, required attention to the cards, because the jump- 
ing stand was raised and the rat jumped against the later-to-be- 
discriminated figures during the presolution period. Under the circum- 
stances of the second experiment the reinforcements during the presolu- 
tion period modified the later discrimination learning, as continuity 
theory would predict. Ehrenfreund concluded that his findings as a whole 
supported the continuity theory. 

The implied importance of “attention to the stimuli” has been given 
further study by Lawrence (1949) (1950), who makes a distinction be- 
tween “relevant” and “irrelevant” stimuli, and who believes that his results 
mediate some of the differences between continuity and non-continuity 
theories. A relevant cue is one consistently correlated with reward or 
punishment, whether alone or in combination with other cues. Thus 
Ehrenfreund’s cues in the presolution period were relevant ones because, 
for the experimental group, only one of the pair of figures was ever re- 
An irrelevant cue is one that is presented inconsistently, some- 


inforced. à i 
and sometimes with punishment. The stimulus cards 


times with reward 


during presolution were 
ind that when an animal learns to discriminate between a 


pair of cues in one situation there is transfer to a new situation, even 
different instrumental responses are required. The cues 
ness, SO that new responses to them can be learned 


irrelevant cues in Ehrenfreund’s control groups. 
£ 


Lawrence fou 


though entirely 


acquire distinctive : 
48 These new responses include reversals of habit, so that 


more readily. j ? 5 
ntinuity theory is not applicable. Lawrence's inter- 


an oversimplified co! 
48 Lawrence has really restated and given new evidence for Robinson’s law of 
acquaintance ( Robinson, 1932a, page 118). 
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pretation is in stimulus-response terms, but implies a aan an 
learned to the cues. This mediating response makes the cues distinc 
so that a new association is more readily established between them and 
instrumental responses. Lawrence and Mason (1955) have reported a 
somewhat elaborately designed experiment involving three sets of cue- 
dimensions (black-white, high and low hurdles, and spatial position). 
They showed that with two cue-dimensions present, one may remain non- 
functional while the animal is responding to the other; when the first has 
become relevant again, the discrimination is picked up as though the 
reinforcement and non-reinforcement of these cues did not matter while 
the other cues were relevant. When three dimensions were involved, 
there was a tendency to fall into systematic position preferences when 
the cue relationships to reward were changed. The authors suggest that 
the greater the number of irrelevant stimulus aspects during habit re- 
versal, the greater the likelihood of systematic runs to dimensions that 
do not lead to solution. 


As so often happens after the heat of controve 


rsy cools, the at- 
mosphere of 


debate is changed to one of search for the conditions under 
which various experimental results are found. 


*° Tt often turns out that 
the original issue was a somew 


hat artificial one. The suggestion that cues 
may acquire distinctiveness through prior experience 
pects of continuity theory, but also moves the theory somewhat closer 
to the claims of the non-continuity theorists. 

Another suggestion, bearin 
problem, has been provided by 


how-to 


supports some as- 


g on the continuity and non-continuity 
the experiments of H 
-learn” or what he calls “learning sets, 


with monkeys that, after practice, discrimin 
following a single trial. Because this “ 
gradually, Harlow believes he has d 
ship between insight and prior expe 


The field theorists, 


arlow on “learning- 
” so He finds in experiments 
ation reversals can be made 
“insightful” performance is achieved 
emonstrated a “continuity” relation- 
rience. He writes: 


unlike the Neo-behaviorists, have stressed insight and 
hypothesis in their description of learning, The impressi 


{ on these theorists give is 
that these phenomena are properties of the innate organization of the individual. 
If such phenomena appear independently of a gradual learning history, we have 
not found them in the primate order.’ 

While this statement of Harlow’s somewhat di 
lem, which has to do with how experience is uti 


the non-necessity of experience, his experiments 


storts the insight prob- 
lized rather than with 
are useful additions to 
*® See, for example, Blum and Blum (1949), 


50 E.g., Harlow (1949). See also Bateson (1942) on deutero-learning, 
to learn.” 


as a “learning 
51 Harlow (1949), page 65. 
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the literature on continuity and the modifications brought about by 
prolonged experiences with related problems. 

As an illustration of the trend to treat the discrimination problem 
on the basis of experimental variation, and at a relatively non-controversial 
level, Spence’s mathematical study of the effects of drive level and de- 
layed reinforcement on several varieties of discrimination learning pro- 
vides a good case.” 

He is able to show by a manipulation of his intervening variables 
that sometimes drive level will be important in discrimination, some- 
times not. For example, if, by some kind of forcing, responses to the posi- 
tive and negative stimuli are kept equal, discrimination should be inde- 
pendent of drive level: 


E, =Hx (D+K) (1) 
E_=(HxD)-I (2) 
E,—E_=kKH+ I, the D’s cancelling (3) 


In free responding, habit strength to the positive stimulus will gradu- 
ally exceed that to the negative stimulus. Then we have: 


E, =H, x (D +K) (4) 
E_=(H_xD)-I (5) 
Ey, —E_ = D(H, —H_) + (H4K +1) (6) 


According to Equation (6), the influence of drive upon the dis- 
crimination will depend upon the stage of practice, The relationship can 
become quite complicated. Spence points out how important it is to 
specify the initial and boundary conditions, as well as the laws and 
hypotheses of a theory, when that theory is applied to a particular set 
of experimental relationships. 

Spence (1955) has been very successful in applying his quantitative 
approach to a variety of discrimination data. The analysis of the two- 
bar problem, when responses to one bar are reinforced twice as often 
as the other, serves as @ convenient illustration. The apparatus used in 
the experiment was similar to that used by Logan (1952), and the data 
were unpublished from Ramond (1954). This discrimination problem 
does not involve non-reinforcement (extinction ), so that it is conveniently 
studied according to D and K, without the complication of I (inhibi- 

in free-choice trials, the bar whose re- 


tion). What happens is that, 
sponses have been more frequently reinforced (hereafter called the more 


frequently reinforced bar) is chosen more often than the bar whose re- 


52 Spence (1955), lecture 7. See also Spence (1954a). 
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sponses have been less frequently reinforced, up to a maximum discrimi- 
nation. But as the training continues, reinforcements on the favored bar 
no longer add the increment that reinforcements on the less frequently 
reinforced bar do, so the responses to the second bar “catch up,” and the 
discrimination becomes poorer. Responses to the more frequently rein- 
forced bar under two levels of drive are given in Figure 65. Note that 
under both levels of drive the maximum is at the same stage of train- 
ing. This agrees with the theory, in that the maximum at any one drive 
level is determined by the differences in growth rates of H to the two 
bars, which is a function solely of the number of reinforcements. 
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Spence designates the maximum value (here 36) as Ny. 
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out for this problem (where one bar is reinforced twice as often as the 
other) to be a function solely of i, the coefficient of the exponent used in 
plotting the acquisition curve: 


2 log 2 (1) 


Because Na has been determined empirically from the data (Na = 
36), it is possible to compute i, which turns out to be .0167. Hypothetical 
growth curves, using this exponent, can be drawn for responses to the 
more frequently reinforced bar under the two conditions of drive, as 
shown in Figure 66, using the value of i determined from Na. 


=.O167N 
E- 88(1-10 


POINT OF MAXIMAL DIFFFERENCE (Na) 


S 


EXCITATORY POTENTIAL 


0 48 56 64 72 80 88 96 104 li2 120 128 136 144 152 160 


9 @ 6 24 32 4 
NUMBER OF RESPONSES TO BAR A 


FIGURE 66. Reaction potential to the more frequently reinforced bar under two 
conditions of drive (theoretical, but based on Na = 86). After Spence (1955), Lec- 


ture 7. 


The theory can now be put to a severe test. Will this i, determined 
from these data, fit latency data obtained in the forced trials to the more 
frequently reinforced bar presented alone? Latency is first converted to 
running speed. Then using the relationship between E and running 
speed, as previously determined theoretically (Spence, 1954b), and plot- 
ting Ramond’s data accordingly, an acquisition curve with the exponen- 
tial coefficient i = .0167 turns out to fit satisfactorily. This then meets 
the desire for stable interchangeable constants pointed out in the chapter 


on mathematical models. 
P a. OENE AET 
This mathematical extension of Spence’s discrimination theory re- 
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mains a continuity theory, in that separate equations are written for re- 


sponses to the positive and negative stimuli, and no discontinuous emer- 
gent is anywhere implied. 


Current Status of the Issue Whether Discrimination Is 
Based on Absolute or Relative Properties of Stimuli 


Spence’s second paper (1937) contained the careful 
intersecting gradients of generalization and was therefore 
form well subject to testing. Again a substantial liter 
which only the most recent papers will be reviewed he. 
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learning. Because of its novelty and potential fruitfulness it is worth de- 
scribing in some detail. In this experiment the animal learner always 
faced two identical cards exposed in the Lashley jumping apparatus. 
Each card was divided into an upper and lower half. The lower half 
during training always consisted of an intermediate gray (No. 4 in a 
scale from 1 to 7, with 1 being lightest). The upper half was either No. 
1, 2, or 8, lighter grays, or 5, 6, or 7, darker grays. The animal had to 
learn to jump to the right if the top half was lighter than the bottom half, 
and to the left if the top half was darker than the bottom half. Note that 
it would conceivably be possible to ignore the bottom half of the card. 
It was entirely indifferent, so far as reward and punishment were con- 
cerned. Furthermore, the top halves of the cards could have been 
learned according to their absolute brightness, for 1, 2, and 3 were al- 
ways reinforced when jumping was to the right, and 5, 6, and 7 when 
jumping was to the left. Thus the situation is not one that forces rela- 
tional learning, although it of course permits it. 

We may raise the question: When the rats solved this problem (as 
they did), was it on the basis of absolute or relative stimulus character- 
istics? The experimenters went on to a test with the grays in different 
combinations. Consider the simple expedient of placing the cards up- 
side down, with No. 4 gray always on top. Now with the lighter Nos. 1, 
2, or 8 on the bottom, the top card is darker, and so jumping should be 
to the left (if cues are relational) while jumping should be to the right 
if cues remain absolute. Correspondingly the darker Nos. 5, 6, 7, if on the 
bottom, make the top card relatively lighter, so relative response would 
lead to jumping to the right, while absolute cues would suggest jumping 
to the left, This particular form of transposition test yielded 74 per cent 
of the jumps relationally determined, well beyond the .01 level of sig- 
nificance. Some of the transposition tests lead to the same prediction by 
both the absolute and the relative theories. For example, suppose gray 
l is placed above gray 3. Then the top is lighter, and both components 
have been associated with reward for jumping to the right. With such 
stimuli, 94 per cent of the jumps were in predicted direction. A third set 
of relationships, such as No. 2 over No. 5 and No. 6 over No. 8 makes pre- 
diction from the specific stimulus (absolute) theory ambiguous, for one 
stimulus component is associated with jumps to the right, one with 
jumps to the left. In such cases, instead of chance choices, 95 per cent 
were in keeping with the relational hypothesis.** 

These recent experiments show that responses to relationships are 


s level may also be accounted for on another basis. Be- 
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clearly possible under circumstances in which specific stimulus cues are 
minimized and relational cues emphasized. Again an old controversy 
dissolves in favor of a statement of the conditions under which one or 
another form of response occurs. Rats can react either to the absolute 
characteristics of specific stimuli or to stimuli in relation to each other. 
It is the circumstances that are important in determining which they will 
do. Spence (1952c) has accepted what is essentially this position. 

The mathematical treatment of discrimination 
favoring either absolute or relational discrimination 
theory (1937) was based on the absolute qu 
mathematical treatment by Gulliksen and Wo 
the relative qualities of the stimuli, 


does not require 
» While Spence’s 
alities of the stimuli, the 
lfe (1938) was based on 


e (1952) studied the transfer 
nd that learning was more 
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e training was given directly 
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ation learning, 
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eriments to the study of dis- 
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somewhat in mediating the problem of absolut i 
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nation by showing that, in general, transposition is easier as the chil- 
dren become older. Some importance is attributed to the development of 
language, but the causal factors are obscure and alternative hypotheses 
are possible. One curious finding **. is that a size-discrimination was much 
easier progressively from preschool years to the fifth grade, but then be- 
came more difficult up to the college years. The only plausible interpre- 
tation was that the older students were looking for a more difficult prob- 
lem and overlooked the very simple demands of the discrimination. 
This suggests (for human learning at least) theories emphasizing “set” 
or acquired cue distinctiveness over those based merely on the reward- 
ing and punishing of specific responses. 

These recent studies leave the problem of discrimination at a ma- 
turer level for analysis, beyond the stage of polemical debate in which one 
side or another scores a “victory.” There are systematic modes of solu- 
tion emphasized by the non-continuity exponents, and there are the 
cumulative results of experience emphasized by the continuity exponents. 
There are responses to specific stimulus components as advocated by 
those building their theories upon them, but there are also responses to 


stimulus characteristics defined in relational terms. 


THE FORMALIZATION OF EXPECTANCY THEORY 


Because of its more precise formal structure, stimulus-response theory 
has guided more research and theory construction in recent years than 
cognitive theory. Many of the problems set for stimulus-response psy- 
chologists, however, were posed by those from the other camp: for exam- 
ple, the distinction between learning and performance, latent learning, 
transposition, systematic modes of problem solution, secondary reinforce- 
ment (in the form of sign learning), incentive motivation (the valence 
of perceived incentives). As stimulus-response experimentation and 
theorizing has become more developed many of the challenging prob- 
lems which cognitive theorists first called attention to have taken on new 
interest for the stimulus-response theorists. For example, instead of deny- 
ing latent learning, new stimulus-response derivations are required to 
account for it; instead of denying relational discrimination, earlier theories 
are modified to include it. This is entirely wholesome, and in the end 


may lead to the kind of unification of psychological theory that we all 
wish to see. 

A few psychologists, ™ 
and McConnell (1955). 


any of them trained in stimulus-response ap- 


58 Stevenson, Iscoe. 
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proaches, have become increasingly aware that the cognitive theorists 
were talking about psychological reality, even though their formalization 
of certain phenomena was inadequate. Some of them have turned their 
attention to the formalization of cognitive theories in a manner more 
nearly consonant with the kind of systematic treatment that stimulus- 
response theories have been getting. 
The fact that something may not yet have been done does not mean 
that it cannot be done. Miller (1949) has repeated the invitation of Hull 
(1930) to those friendly to gestalt psychology 


to do for cognitive theories 
what has been done for stimulus. 


-response theory. Yet it must be remem- 
bered that little was done to formalize Thorndike’s theory between 1898 
and 1930. The motivations of psychologists have to be taken into account 


as well as the ripeness of their theories, Here lie some interesting prob- 
lems for the historian of science, 
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believed that they could 
ing. 

Olds (1954) set up a neural model for T 
Hebb’s (1949) discussion of cell assemblie: 
precise to allow for experimental testing, 
based on it has been reported (Olds, 1953) 


ging a preference for stimulus-response 
at applying Tolman’s theory to avoidance learn- 
postulates consonant with Tolman’ 
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used to define an association. The best way to remember the process is 
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process, prior to doing anything to the button right now, 
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If ringing the doorbell becomes a goal (becomes valenced), then the 
expectancy is activated. I push the button only if I want to ring the door- 
bell; I may have the expectancy ($,R,S,) without doing anything about 
it. 

Capital letters are used to describe the significant environmental 
events S,-R,-S.,, and the same letters may be used for the inferred con- 
struct, the expectancy (S,R,S,). The letters are enclosed in parentheses 
when reference is to the construct rather than to events themselves. But 
whenever capital letters are used, whether in describing events or the 
inferred expectancy, it is appropriate to use physical-stimulus language. 
That is, S, and S, are objects in the environment, with their ordinary 
characteristics, and R, is a physical response. The specification of the 
terms of an expectancy in physical language permits not only objective 
verification, but also clear statements about confirmed and unconfirmed 
expectancies. (The logic of the physical language is no different from that 
used in assigning dimensions to Hull’s sHn.) © 

Twelve postulates are stated, often with guesses as to the form of 
mathematical function involved. For a briefer introduction to the flavor 
of the system, the postulates are here paraphrased, with much of the 


quantitative detail omitted. 

1. Mnemonization. The occurrence of the sequence S,-R,-S, results in an 
increase in the strength of an expectancy (S,R,S,). The rate of growth increases 
with the valence of S,. The limit of growth depends on the probability (Py with 
which S, regularly follows S,-R,. That is, growth approaches P as a limit. 

This is the basic acquisition postulate. It is a contiguity theory, 
in that mere occurrence of the sequences presumably leads to increase 
in expectancy. But it is also a kind of reinforcement theory, because the 
expectancy is influenced by the valence of the terminal item (S,) in the 
sequence. It is not, however, a stimulus-response theory in the usual 
sense, MacCorquodale and Meehl point out that by their accepting speci- 
fied parameters for rate and limit of growth they have not made conces- 
sions to some form of non-expectancy theory. 


2. Extinction. The occurrence of a sequence S,-R,, if not followed by S,, 
tends to produce a decrement in the expectancy. 

Two cases have to be distinguished. If P has been 1.00 (that is, 
S,-R, always followed by S+) a single omission of S, will lead to a 


dale allow for a central state called the expectant (sins), 
et ee he their preliminary formalization. Presumably if one were 
to faeces neurophysiological explanations there would be some translations between 
the neurophysiological language of (siis:) and the physical language of (S,R,S:). 
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prompt decrement, depending both on the current strength of (S,R,S,) 
and the valence of S,. If, however, P has been less than 1.00, the course 
of decrement will be slower, and, if P shifts to a lower P’, the expectancy 
will approach the new value of P’ as an asymptote."° 


3. Primary-stimulus generalization. When an expectancy (S,R,S,) has been 
strengthened, other expectancies (S’,R,S,) will also have received some strength, 
depending upon the similarity of the elicitors S’, and S,. 


Similarity gradients-are an empirical matter, and there is no point in 
trying to distinguish the functions from those used by stimulus-response 
psychologists. Tolman (1939) recognized this when he built Spence’s 
(1987) gradients into his “schematic sowbug.” 


4. Inference. When an expectancy (S,R,S,) exists at some strength, the 


presence of a valenced object S° in close contiguity with S, gives rise to a new 
expectancy (S,R,S°). 


5. Generalized inference. Even though (S,R,S,) is originally of zero 
strength, the occurrence of a valenced object S° in close contiguity with S, may 
give rise to a new expectancy (S,R,S°), provided (S,R,S’,) is of some strength, 
and S’, is similar to S, j ü 


These are the kinds of postulates said by MacCorquodale and Meehl 
to contribute heavily to the identification of an expectancy theory. They 
give the basis for latent learning and for the interpretation of experi- 
ments modeled after Maier’s reasoning experiments. 

The next six postulates have to do with motivation, described ac- 
cording to needs, cathexes, and valences, I am taking the liberty of re- 
numbering them for expository purposes. They can easily be identified 
in the original source because their titles are unchanged. The three fol- 


lowing postulates state the fundamental position with respect to motiva- 
tion. 


6. Cathexis. The cathexis (C°) of a valenced stimulus situ 


z A ation (S°) i 
function of the number of contiguous occurrences between it Nd 


and i 

matory response. the consum: 

T. Need strength. The need (D) for a cathected stimulus (SA ikam io 
creasing function of the time interval since satiation for it, 


8. Valence. The valence of a stimulus S° is a multi plicative f F 
correlated need D and the cathexis C° attached to S°, £ unction of the 


60 We know from Humphreys’ (1939b) experiment that followin 
may be an incressing level of (S:R,S:) for a few trials if S, is regu 
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Postulates 6 to 8 imply a fairly complex theory of motivation, gen- 
erated (as Spence’s recent theory was) by the problems of latent learn- 
ing. The sight of food leads to food-seeking because of the correlation of 
sight of food with consummatory response (eating). This is the meaning 
of cathexis. Food thus gains a certain attractiveness (cathexis) through 
prior experiences of eating, but its present pulling power as an incentive 
(its valence) depends also on how hungry the organism is (its need 
level). Need is a function of deprivation; cathexis is a function of ex- 
perienced satisfactions in the past; valence is a product of both. There 
is a striking parallel with Hull and Spence’s D and K, which act together 
(as a product or as a sum) to influence reaction potential. 

Now we are ready for some postulates (actually earlier in the origi- 
nal list) concerning some further relationships between experience and 


cathexis. 


9. Secondary cathexis. The contiguity of S, and S° (a valenced stimulus) 
increases the cathexis of Sa. 

This is a kind of “secondary reinforcement” or acquired incentive 
value, whereby the valence of one expectandum is given to a contiguous 


expectandum. 

10. Induced elicitor-cathexis. The acquisition of valence by an expectandum 
S, belonging to an existing expectancy ($,R,S,), induces a cathexis in the elici- 
Ti Confirmed elicitor-cathexis. The confirmation of an expectancy (S,R,S.), 
when S, has a positive valence, increases the cathexis of S,. 

These two postulates mean that an elicitor, as a discriminative stimu- 
lus, comes to have reinforcing power. It is easily demonstrated that cues 
if similar to cues in the goal-box, facilitate learning. 
d according to secondary reinforcement, the no- 
tion of acquired elicitor cathexis is roughly equivalent.®? 

The difference between “induction” and “confirmation” is that be- 
g food after satiated runs, and learning while hungry. 


at a choice-point, 
While usually interprete 


tween introducin 


e reaction potential sE, of a response R, in the presence 


12, Activation. Thi p 
function of the strength of the expectancy (S,R, S,) and 


of S, is a multiplicative 
the valence of the expectandum. 
This final postulate gets around the jibe of Guthrie (1952, page 143) 


that Tolman leaves the rat buried in thought. The parallel with the Hull- 


Spence basic formulation is striking: 


62 Bauer and Lawrence (1953) show that the problem of choice-point cues and 
goal cues is complex. Conditioned inhibition may turn out to be as relevant as sec- 


ondary reinforcement. 
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Hull-Spence 
sEr = f (sHr) X f(D, K) 
MacCorquodale-Meehl 
sEr = f(S,R,S,) x f(D, C°) 


As one goes on to specify in more detail the relations between re- 
action potential and response, there remain the problems of thresholds, 
oscillation, and the like. MacCorquodale and Meehl point out that there 
is no reason to handle these further matters differen 
model and the stimulus-response model. 

How successful are these postulates in deducing the experimental 
phenomena of learning? MacCorquodale and Meehl (1953) 
the procedure by deducing the phenomena 
representing subvarieties of latent learning. 
well in accounting for the data through mani 

The postulate set is avowedly we: 
of particular interest to 


tly in the expectancy 


illustrate 


asis on “maps” 
eful beginning in showing, first, how a theory 
al reference to Te- 
expectancy setting, and second, when m 


ge areas of agreement between a theory of this kind 
and stimulus-response theory. 


ade explicit, 


Rotters Social-Learning Theory 


Rotter (1954), in a book designed to provide a theory for clinical 
psychology, has evolved an expectancy-reinforcement theory with many 
points of resemblance to the MacCorquodale and Meehl model, 

His basic formula for behavior potential (B.P.), the equivalent of 

» makes it a function of ex 


That is: 


BP. = f(E, RV.) 
It will be recalled that in other sy: 
monly has’two components, one reflectj 


> » the other reflect- 
ing goal-activity or incentive value. Rotter’s theory concentrates upon 


63 The experiments are those of Kendler (1946), Meehl and MacC 
and Tolman and Gleitman (1949). acCorquodale (1948), 
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the incentive value, and the reinforcement value (R.V.) is measured 
directly by the preference for one goal-object over another when the 
possibilities of occurrence are equal. Something similar to Allport’s func- 
tional autonomy of motives is implied: social goals are learned ones, and 
they need not be referred back to physiological needs. Instead, present 
reinforcements are best understood in relation to prior reinforcements: 


We find it a sufficient basis for prediction to state that behavior directed to- 
ward the attainment of a learned goal or external reinforcement may be predicted 
through a knowledge of the situation the organism is in and from a knowledge of 
his past learning experiences. It appears as if a theory that attempted to predict 
meaningful behavior through description of the organism’s physiological drive 
condition at a particular moment would hold not only no advantages but many 


disadvantages."* 

The theory has a rather different flavor from the familiar ones be- 
cause of its social, hence more molar, reference. For example, the list of 
six major needs brings newcomers to learning theory: recognition-status, 
protection-dependency, dominance, independence, love and affection, 
physical comfort.” 

Another new concept is that of freedom of movement, which is re- 
lated to psychological need. It is defined as: The mean expectancy of 
obtaining positive satisfactions as a result of a set of related behaviors 
directed toward the accomplishment of a group of functionally-related 
reinforcements.** What this means is that high freedom of movement 
implies an expectancy of success for many different kinds of behavior in 
many different situations related to a family of common goals. Low free- 
dom of movement, of course, implies the opposite. 

Rotter’s theory has been subjected to a number of experimental 
tests in relation to its quantitative assumptions, but few of the results 
have yet reached publication, so that critical appraisal in relation to 
other learning theories must be postponed.” It is a significant effort at 
theory construction, however, and should be seriously considered because 
of its attempt to broaden the boundaries encompassed by our present 


quantitative theories. 


64 Rotter (1954), page 116. 
65 Rotter (1954), page ka 
66 Rotter (1954), page 194. i zi 
Q ea pe E peared at the time of writing, e.g., Crandall (1951), Schroder 
and Rotter (1952), Rotter, Fitzgerald, and Joyce (1954). Other previously unpub- 


lished dissertations are appearing currently. 
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NEUROPHYSIOLOGICAL INTERPRETATIONS 
OF LEARNING 


Among the systematists whose views are presented in Chapters 2 
through 10, only Thorndike, Köhler, and Freud gave any appreciable at- 
tention to the neural correlates of their theories. Thorndike’s early “syn- 
aptic resistance” theory seemed no longer important to him in his 1 
years. Freud’s influence has been functional rather th 
Only Köhler has continued to attempt to find 
his principle of isomorphism. 


ater 
an neuroanatomical. 
physiological support for 


This relative lack of interest by learning theorists in the nervous 
system, particularly in the period of 1930 to 1950, is again a problem for 
the historian. No one denied the importance of the nervous system in 
learning. The denial had to do only with its importance for theories of 
learning. Possibly one influence was a certain disillusionment about 
progress in neuroanatomy and neurophysiology after the bre 
of the localization doctrine under many lines of evidence, including 
Head’s studies of aphasia and Lashley’s cortical ablation studies. If no- 
body really knew anything about how the brain worked, what help could 
present neurophysiology be to the learning theorist? So the argument 
ran. Perhaps another influence was the tendency for objective ‘studies 
to favor “peripheral” rather than “central” interpretations. Guthrie and 
Hull were making the case for fractional anticipatory responses and 
movement-produced stimuli; g that we stick to the 
data and avoid the cn.s, (“conceptual nervous system”). When even a 
“centralist” such as Tolman had little interest in the nervous system, it 


was not surprising that the non-physiological position temporarily won 
the day. 


A reaction has now begun to be felt, and the rel 
system to learning is awakening renewed interest. Although, to be sure 
the interest had never fully died out, the renewal of widespread eves 
among the theorists can be dated by the appearance of Hebb’s (1949) 
neuropsychological theory. Hebb brought a freshness of approach capi- 
talizing on recent developments in neurophysiology a Wis 
some daring speculations in a non-polemical manner that invited tractable 
consideration. Two specific events that marked the reaction away from 
the non-physiological period are some remarks in a discussion by Tolman 
(1949c) pleading for a more explicit neurologi 
quent insistence by Krech (1950) 
construct in psychology in purely 


akdown 


Skinner was arguin 


ation of the nervous 


nd introducing 


neurological terms, 
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The Experimental Contributions of Neurophysiology to an 

Understanding of Learning 

A great many experiments have been devised through the years in 
an effort to locate the parts of the brain essential for carrying out learned 
activities. While results of interest have been secured, they have not led 
to any significant generalizations about the precise mechanisms of learn- 
os 
Although the new interest in neurophysiological theories of learn- 
ing also has thus far led to little that is of genuine explanatory value, a 
number of leads have been opened. Without attempting to appraise the 
vast literature, we may mention a few of the developments as indicative 
of some of the directions in which research is moving. 

1. Electrical exploration of the brain of human subjects under local 
anesthetics (especially with epileptic patients) has located some sites, 
particularly in the temporal lobe region, where vivid memories appear 
to be re-enacted under stimulation.” 

2, The temporal lobes, particularly the amygdaloid nuclei, have be- 
come of special interest not only in studies of emotion, but also in the 
affective control of learning, Cats with these nuclei ablated required more 
trials to learn an avoidance response than cats with control lesions, but 


ing. 


their retention was unaffected." 

3. Recent work, in part using the electroencephalograph, has called 
attention to a diffuse system by which the midbrain affects the cortex 
other than through anatomically defined sensory pathways. This diffuse 
system operates via the reticular substance, and permits incoming sensory 


stimulation, acting at a subcortical level, to waken the organism and 


incite it to activity.” 

4, Working with amplifiers that can record resting potentials or 
slowly varying ones, it is possible to detect changes of potential on the 
surface of the skull at the occipital lobes corresponding in some ways to 
the patterning of visual stimuli. These results appear to give some sup- 
port for Kéhler’s field theory of excitation.” 

5. Imbedded electrodes worn while animals engage in learning are 


68 Useful summaries of the major findings may be found in reviews by Marquis 


(1951), Morgan (1951), and Osgood (1958). 


69 a 1954). 

70 ores on ioe eA}. The problem was opened up by Kliiver and Bucy 
(1939). For a review of temporal lobe function see Milner (1954). 

71 E.g., Lindsley and others (1950), Magoun (1952). 7 

72 kelkie and Held (1949), Köhler, Held, and O'Connell (1952), The theory sup- 


ported has been criticized experimentally by Lashley, Chow and Semmes (1951). 
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being used to study the effects on learning of stimulating various por- 
tions of the brain.** ia 

6. Some chemical differences are found in the brain tissue of rats 
high in “hypothesis” behavior as contrasted with those low in “hypothesis” 
behavior.”+ 

Interesting as these various leads are, they are mostly rather remote 
from the central problems of learning, although some of them be 
the problems of motivation and the nature of reinforcement, 


ar on 


Hebb’s Speculative Neuropsychological Model 


Because of its influence in renewing interest in neurophysiological 
speculation, Hebb’s (1949) theory deserves special mention," 

Hebb recognizes that the basic pr 
the something that intervenes between 
is not fully controlled by environment 
plain thought in neurological terms and of course to do so without an 
shade of animism. He has no truck with a little man in the head who 
wants or disapproves or guides. Hence, while Hebb wishes a th 
adequate to the phenomena of a cognitive psychology, 
ness in it, and so ties it as closely as possible to wha 


eory 
he wants no soft- 
t we know about 


in the telephone 
ution of potentials 
field theorists, 


erstanding cortical] 
ce. 


: i assembly, 
by sensory stimulation, or by both at once. 
The neuronic basis for a persistin 


8 reverberating circuit js specula- 
tive. Hebb suggests that stimulation 


repeated often enough to have a 
18 E.g., Olds and Milner (1954), Delgado, Roberts and Mill 
74 Krech and others (1954). iller (1954), 


75 No attempt is made here to review the many speculative model z 
peared, e.g., Ashby (1952), Coburn (1951) (1953), McReynolds (Issa) eag 
Wolpe (1949) (1953). 53), 
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lasting influence results in the growth of “synaptic knobs” that increase 
the area of contact between the axon and the tissue into which it fires. 
This comes only after repetition; the primary process creating the cell 
assembly depends on convergences of excitation in a rather diffuse 
neurone network, as a consequence of contiguous stimulation. Some 
“equipotentiality” is implied, because once a cell assembly is set up, 
fewer fibers have to be active at once in order to traverse the synapses 
necessary to arouse the cell assembly. 

A phase sequence is constituted by the arousal of a series of cell 
assemblies. For example, looking successively at the three corners of 
a triangle arouses the cell assemblies appropriate to each corner, and 
these facilitate each other. Thus the whole triangle may be perceived 
while attending to any one corner. Even the act of perceiving a triangle 
involves rather complex cortical facilitation, both sensory and motor; 
for example, we can anticipate (“expect”) the change that will come 
about by changing fixation from corner A to corner B. Thus the phase 
sequence is the thought process. 

At birth there is an intrinsic organization to cortical activity, indi- 
cated by the large slow waves of the infant’s electroencephalogram. 
For learning to take place, sensory processes have to interrupt this 
intrinsic organization and gain control of association-area activity. Hence 
the larger the association area (relative to the sensory areas), the more 
difficult it is for the young to learn. Hebb accounts on this basis for the 
slowness of learning by the human infant as compared with lower 
animals. 

The first stage of learning is the establishing of cell assemblies and 
their related phase sequences. At first this comes about slowly, and slow- 


increment learning is characteristic of childhood. But the results of such 


learning endure and are influential throughout life. 

Adult learning is essentially conceptual, often prompt and insight- 
ful. Such learning is possible, according to Hebb, only when the stimu- 
lation sets off well-organized phase sequences. 

This is the theory in broad outlines. Hebb’s book has been influential 
not only because of its neurophysiological ideas but because of the 
wealth of psychological observations reported in it. Unfortunately it is 


impossible to convey this richness of detail in a few paragraphs. 
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CONCLUDING REMARKS ON CURRENT DEVELOPMENTS 


The theoretical positions developed in the first half of the cen- 
tury have had their fruits not in the final triumph of any one system 
but in a more sophisticated approach to both experimentation and re- 
finement of theory. The vast volume of experimental work now appear- 
ing is almost all oriented to the checking of one or more hypotheses, 
and these hypotheses are imbedded in larger theoretical contexts. This 


kind of development tends at once to soften the clash 


es between sys- 
tematists as they 


attempt soberly to face common facts, and to improve 
fact-gathering by experimenters. The newer facts, oriented to more pre- 
cise theories, are more likely to be additive than they were in the past. 

We have reviewed the kinds of searching criticisms that are going 
on in current studies of reinforcement, of motivational aspects of learn- 
ing, and of the problems of learned discrimination. The efforts to sys- 
tematize “expectancy” interpretations come in recognition of the cogni- 
tive and directional aspects of learning, Neurophysiological speculation 
is again welcomed. These developments have not yet reached any cli- 
max, and no mediator has yet appeared to enlist agreement on some kind 
of harmonious interpretation of all the phenomena of learning. Yet prog- 


ress is being made, and there is reason to hope that we are moving toward 
the unification of theory. 
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Chapter 14 


RETROSPECT AND PROSPECT 


j ligen array of theories we have examined may be thought to present 
an unfavorable picture of the state of systematic knowledge about 
learning. How can psychologists be helpful to other social scientists or 
to those who wish to apply their findings if they disagree among them- 
selves on these fundamental matters? It seems to us that there are several 
considerations which temper this adverse judgment. For one thing, there 
is a great deal of empirical knowledge about learning which is unrelated 
to the differences among the major points of view. For another, there are, 
in addition to the general theories, a number of special theories, more 
closely related to particular experimental situations. These special theories 
are tested and corrected in a matter-of-fact manner, cutting across the 
preferences of rival theoretical positions. Finally, discipleship in one or 
another of the major schools is not characteristic of most psychologists 
working in the field of learning. There are strong tendencies toward medi- 
ating positions. Many writers have found the major contributions of the 
different schools largely reconcilable.* 

The disadvantages of conflicting points of view are balanced to some 
extent by the motivation which a strong position provides its proponent 
and by the challenge it presents to its adversary. Out of the heat of con- 
troversy there is eventually scientific advance.* Even in the midst of con- 
troversy, it is not necessary for the theorist to set aside reasonableness or 
critical powers. Theoretical differences show up most markedly where 
the data remain ambiguous, so that preferences hold sway over evidence. 
As the data become more securely established, it becomes increasingly 
possible to translate one system into andthe. 

While the state of knowledge is not therefore as bad as the parade 
f view makes it out to be, it is still rather unsatisfactory, There 


of points o A 
a laws of learning which can be taught with confidence. Even 


1 E.g, Dashiell (1935), Kellogg (1938), McConnell (1942), 


2 Boring (1929). ET 
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the most obvious facts of improvement with practice and the regulation 
of learning under reward and punishment are matters of theoretical 
dispute. 


TOWARD A MORE SATISFACTORY THEORY 


A generally satisfactory theory awaits a set of concepts which will 
be appropriate to all that is known about learning. While each major 
theorist believes his concepts to be the appropriate ones, their perti- 
nence often derives from the fact that the theorist’s interest is prim 
in some segment or aspect of the total learning situation. Some are i 
ested in the initial adjustment, as 
are interested in the predictabili 
ditions of motivation, and so on. We need a more careful deline 
the kinds of learning which take 
of its own), and an acceptable f 
which make demands upon theo; 
because they are concerned with 


arily 


nter- 


ory construction, based 
ationships, and in addi- 
onships into a system. 
take. In fact, it is useful 
truction just as there is a 
some of the needed direc- 
rk in any one of them: 


relationships. 


Because this is so, two possibilities remain open. One is 


e mediating concept remains 
as to be specified and meas- 
that derive from the opera- 


intangible, but it must be “triangulated” so 
ured. It must have units of measurement 
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tions giving rise to it. MacCorquodale and Meehl (1948) named this 
kind of mediating process an “intervening variable.” 

The other possibility is to give the intervening process tangible 
or substantial status, and then search for it directly. MacCorquodale and 
Meehl call this kind of tangible intermediary a “hypothetical construct.” 
The experiments concerned with the need-reduction theory of reinforce- 
ment are of this sort; they are not studying the place of reinforcement 
in the equations of Hull, but they are going at the substance of a rein- 
forcement postulate. Similarly, the direct attacks on drive in relation to 
food deprivation can be launched without worrying about drive as a 
multiplier of habit strength. The fractional anticipatory goal response 
is substantive and ought to be directly observable in appropriate experi- 
ments. A neurophysiological theory, provided it leads to concrete ex- 
periments, can give a substantive basis for the mediating processes in 
either the peripheralist or centralist theories. 

3. Greater inclusiveness. Spence (1953) has given us a diagram show- 
ing six levels of lawful relationships within learning data, to which corres- 
pond six first-level theories, close to these varieties of data. Above these 
in a logical hierarchy are second-level theories that co-ordinate the first- 
level theories, and then there may be more general theories at the third 
level. It is his contention that we must work up from the fields in which 
the data are best quantified, and where the interrelationships of vari- 
ables have been most thoroughly explored. At present, our most precise 
theories deal with conditioning, selective learning, and serial learning. 
Our first-level theories are very weak in the study of skills, problem- 


solving, and group learning. Spence’s diagram is reproduced as Figure 


67. 


We may accept Spence’s analysis as historically accurate, without be- 
ing entirely bound by it. Some theorists, for example, instead of waiting for 
the better lawful relationships to be established in the fields of skill, 
problem-solving, and ] ; 
help in the design of experiments that will result in studies revealing 
lawful relationships in these areas. Spence s diagram has the virtue that 
it does not prejudge the question as to which kind of learning is more 
fundamental, for the six kinds of learning are all at Level 1. The fact 
that the situations at the left are somewhat simpler, in terms of the 
control of experimental variables and the gathering, of data, need not 
imply that any more fundamental psychological pip cesses ote involved. 

The search for greater inclusiveness is one direction for theory to take, 


and this search does not have to wait. 


group learning, will wish to set up models to 


460 


THEORIES OF LEARNING 
4. A more exact applied psychology of learning. This scarcely 
sounds like theory at all, yet a high order of theoretical work will be 


necessary to arrive at a set of practical rules that will make general 
psychological laws of learning applicable to teaching and training. It 
is one of the challenging tasks confronting the psychology of learning, 
and able talents are needed for it. 
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These several directions of advance suggest how much there is to 
us. 


be done, and how defeatist it would be to wait for the “brilliant theo- 
rist,” the Newton or the Einstein who will solve 


all of our problems for 


SPECIAL PROBLEMS THAT LEARNING THEORIES FACE 


Next in our retrospect we turn to some of the typical problems that 
were used to summarize the several systems, and to some rel 
solve. 


ated ques- 
tions that arise as we consider the problems the theorists tried to 
Learning Capacities 


The extreme form of generalization from comparative studies is that 
there are no differences, except quantitative ones, between the learning 
of lower mammals and man. Hull stated this quite clearly: 

The natural-science theory of behavior being developed by the present au- 
thor and his associates assumes that all behavior of the individuals of a given 
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species and that of all species of mammals, including man, occurs according to 
the same set of primary laws. 

While such a position is more often implied than asserted, it is 
strange that the opposite point of view is not more often made explicit— 
that at the human level there have emerged capacities for retraining, 
reorganizing, and foreseeing experiences which are not approached by 
the lower animals, including the other primates. No one has seriously 
proposed that animals can develop a set of ideals that regulate conduct 
around long-range plans, or that they can invent a mathematics to help 
them keep track of their enterprises. Because a trained dog shows some 
manifestations of shame, or a chimpanzee some signs of co-operation, 
or a rat a dawning concept of triangularity, it does not follow that these 
lower organisms, clever as they are, have anything comparable to the 
richness of human mental activity. 

The problem of emerging types of learning ability must be pushed 
farther back. Language in man is perhaps the clearest of the emergents 
Which carries with it a forward surge in what may be learned. It seems 
plausible enough that other advances in the ability to learn must have 
come about as the nervous system evolved through successive stages be- 
low man, The prevailing learning theories have had remarkably little to 
say about the different processes which together make up the functional 
changes which are classified together as learning.‘ There are probably a 
number of different kinds of learning which have emerged at different 
evolutionary periods, with the more highly evolved organisms using 
several of them. It is quite probable that these different kinds of learn- 
ing follow different laws, and it is foolhardy to allow our desire for 
parsimony to cause us to overlook persisting differences, 

Learning capacity can be assigned a quantitative meaning only when 
l ng capacities are identified, and when we know more 


the several learnit i s 
1 these capacities are modified. To what extent 


about the ways in whicl vais ; 
can the ceiling of ability itself be modified by taining? It is evident that 
Without appropriate prior experiences there ate things to be learned that 
a man’s capacity which he will never acquire. Without 
c or mathematical tools, for example, there are 


not solve, regardless of how much native in- 


are quite within 
appropriate linguisti 
kinds of problems he can paiva 
telligence he has. After these tools are acquired, his learning ability rises, 


i infer £ ceili f his ability was raised. 
Without any necessary inference that the ceiling o y 


The obstacle to a precise scientific statement of the problem lies in 


3 Hull (1945), page 56. 


+ Maier (1939a, 1939b) and Tryon (1940) have stressed similar points. 
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Stimulus, Response, and Mediating Processes 


Stimulus and response concepts have a long history in psychology. 
The original S-R psychology was not that of learning theory at all: it be- 
gan with reaction time, as psychologists measured the time between the 
signal and the response to it, and then tried in various ways to fractionate 
this time. Even at the start, it was clear that there were intermediate 
processes beyond merely the speed of impulse transmission. In his early 
criticism of the reflex-arc theory, Dewey (1896) had in mind these re- 
action-time experiments and the generalizations that were being made 
from them. Woodworth, a student of the neuro-physiologist Sherrington, 
urged strongly that the S-R motto be hung on the walls of the psycho- 
logical laboratory as a reminder to the student to think in stimulus- 
response terms. Woodworth later modified his S-R motto to read S-O-R, 
recognizing that the mediating processes in the organism always had to 
be taken into account in the correlation between stimulus and response. 

It is meaningless to approve or disapprove of stimulus-response in 
general, because all psychology can be translated into stimulus-response 
terms. Before expressing approval or disapproval it is important to 
specify what kind of stimulus-response theory is under consideration, 

Two kinds of stimulus-response theory are worth distinguishing. 
One of these we shall call a substantive S-R theory, the other a func- 
tional S-R theory. 

By a substantive S-R theory we shall mean the kind of theory that 
says that the only facts of interest to a psychologist are facts of physical 


stimulation and measurable responses (muscular movements and glandu- 


lar secretions). Not only is the theory tied to S’s and R’s, but all inferences 
are to S’s and R's, including inferences to mediating processes. For this 
reason, such theories are also called peripheralist. Hull's fractional antici- 
patory goal response (rg) is a good representative of a substantive 
intermediary. 

A functional S-R theory is anchored to stimuli and responses as points 
of reference, in giving operational definition to the measurements used in 
the theory. That is, the theory always points to S’s and R’s in describing 
its experimental results or in beginning its theory construction. Mediat- 
ing processes, however, need not be conceived in S-R terms, and they 
may have purely functional characteristics. The drive level (D) in the 
theories of Hull and Spence is a concept of this kind. It is given opera- 
tional definition in physical terms, according to maintenance schedule 


and hours of food deprivation, but D itself is characterized by a mere 
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number, a multiplying constant without any dimensions of its own, unless 
we infer it to have the dimensions of reaction potential. Thus a system 
such as Hull’s or Spence’s may be substantive in some parts and func- 
tional in others. 

The functional S-R theory is generally congenial to the functionalist 
psychologists, whereas the substantive S-R theory is more common among 
neo-behaviorists. Thus the stimuli used in a memorization experiment may 
be classified by the functionalist according to degree of meaningfulness. 
Because degree of meaning can be specified denotatively by reference to 
the material used in the experiment, the difficulty of defining the 
material in strictly physical stimulus energy terms does not worry him, 
although it should worry the substantive S-R theorist. The functional 
S-R position permits further specification of the mediating processes, 
but these need not be specified in S-R terms. Thus the mediators can be 
brain states, if evidence can be found for them.’ 

MacCorquodale and Meehl (1953) have shown that an expectancy 
theory can be written in functional S-R terms, so that the main difference 
between their theory and one such as Hull's or Spence’s is the difference 
that I have indicated between substantive S-R theories and functional 


S-R theories. 

If this analysis e t 
largely over the nature of and specification of the mediating processes. 
The factual results are going to be accepted in the end by any theory 
that survives, so that there is no point in looking for a crucial experiment 
at the level of learned behavior. It is only experiments that bear crucially 
on the mediating processes that will make possible a choice between 
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Perception and Learning 
The problem of perceptual discrimination is a central one for learning 
theory There is some uncertainty as to the manner in which perception is 
natively organized, but at a primitive level there are undoubtedly some 
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discriminations of continuity and discontinuity, of change and succession, 
of boundary phenomena, of proximity and remoteness. There is an old 
controversy between those who believe that the organization of perception 
is essentially unlearned, and those who believe that practically everything 
within perception is learned.* It is not necessary to enter this controversy 
in order to point out that the learning theorist must be aware of the learn- 
ing within perception as well as of the role of perception in learning. 

The perception of objects suffices to show that there is at least some 
learning in perception. A cake of ice looks cold; the red tip of the poker 
looks hot. The properties of cold and hot, not present to the senses, are 
present in the perception as a result of prior experiences. They are not 
judgments or deliberate inferences from perceptual data, but are given 
as immediately as any other of the properties of the perceived objects. 

The process whereby identifying signs are used to represent a total 
situation previously experienced in called redintegration.® The ice looks 
cold because the previous experiences of felt cold are redintegrated by 
its appearance. The paramnesic experience (sometimes referred to as 
déja vu) is probably of this sort. That the process occurs is evident, but its 
explanation is not easy. 

If we accept the fact of perceptual learning, and we believe in some 
sort of goal orientation in learning, can we discover any goals in per- 
ception that might control perceptual learning? I have elsewhere sug- 
gested two possible perceptual goals: (1) the achievement of environ- 
mental stability, and (2) the achievement of clarity and definiteness in 
perception.'° 

The organism seeks a perceptually stable environment in a fashion 
somewhat parallel to that in which it seeks an internally st: 
ment. There is a kind of environmental homeost 
logical homeostasis, The equilibrium is a dynami 
environment, like the internal one, is ever ch: 
ates perceptual differences between night 
logical differences between waking and sle 
an environment that distorts too rapidly. If a man’s perceived environ- 
ment distorts too rapidly he gets upset or seasick. 

In normal perception the goal of stability accounts for many of our 
perceptual achievements. For example, were it not for this achieved 
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stability, the visual world would move as we move our heads from side 
to side. That such stability is an achievement is easily demonstrated by 
seeing what happens when you view the world through reversing lenses. 
When the visual world is unfamiliar, as it is through reversing lenses, 
the line of regard is the anchoring point, and when you move your head 
the world races by in the direction opposite to the movement of your line 
of regard, Stratton (1896) (1897) found that after wearing the lenses for a 
while the visual scene no longer would swing with head movements: sta- 
bility had again been achieved.” 

The goals of stability and of definiteness are similar, but they can 
be distinguished. In reversible geometrical figures, for example, defi- 
niteness is achieved at the price of stability. Stability might be better 
achieved if a geometrical figure were perceived as only so many lines 
on the paper. Instead we see it as a something, even though it is am- 
biguous, and so the “somethings” it represents tend to alternate. 

Woodworth felt convinced that there is a fundamental motive to 
Perceive clearly. As he put it: “To see, to hear—to see clearly, to hear 
distinctly—to make out what it is one is seeing or hearing—moment by 
moment, such concrete, immediate motives dominate the life of relation 
with the environment.” * ; ; 

Recent emphasis upon “curiosity” as a motive for learning suggests 
that such “curiosity” may also be a motive within the learning of per- 


Ceptual responses themselves.** ; 
There is a strong tendency to construct concrete things out of the 
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the learned features of perception, in line with other studies of motivated 
learning.'” 

The relationship between perception and learning is a reciprocal 
one. So far we have considered the role of learning in perception. Now 
we may consider the role of perception in learning. The cognitive 
theories all assign an important place to perception and perception-like 
processes in learning; some even have gone to the extreme of identify- 
ing learning with perceptual restructuring. Currently S-R psychologists 
are paying more attention to perception in learning. Spence (1955) 
includes “looking at” the food in the goal-box as a feature of reinforce- 
ment. Lawrence (1949) (1950) emphasizes acquired distinctiveness of 
cues as a part of discrimination learning. Wickens (1954) decides that 
the stimuli in learning experiments arouse perceptual responses that are 
the intermediaries to which overt movement responses are made. 

A complete learning theory must incorporate perceptual processes 
as part of the main body of theory. Secondary reinforcement, increasingly 
prominent in S-R theories, is mediated largely by something akin to per- 
ception. These relationships cannot be permitted to remain unexamined 
by the theorist. 


Motivation 


In Chapter 12 we had occasion to review some of the studies con- 
cerned with the details of drive and reinforcement, and with the current 
criticisms of the need-drive-incentive conception of motivation, An im- 
portant contemporary trend is to supplement the tension-reduction theory 
(escape from food tensions, escape from pain or fear or anxiety ) with more 
positive incentives, such as exploration and curiosity, or the arousal of 
positive (pleasurable) affect. 

That there are goal tensions, and that the goal situation is an end- 
state that makes a change )( either toward relaxation or clation) cannot 
be denied. In that sense, everyone must accept a reinforcement princi- 
ple, meaning nothing more by it than that rewards and punishments mat- 
ter for learning. This has come to be called the empirical law of effect. 
Its incorporation into quantitative theories may remain at this level, 
but its further theoretical explanation raises a number of additional 
questions. 


16 The many recent studies of perceptual distortion by way of needs and persistent 
personality characteristics are also relevant even though somewhat controversial. See 
summaries by Bruner (1951), Klein (1954), Postman (1953). Against a learning 
theory of perceptual change, see Gibson and Gibson (1955). á § 
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The necessary distinction between performance and learning, be- 
tween facilitation and association, as Seward (1952a) puts it, does not 
require that motivation be assigned to one of the terms and not to the 
other. Everybody agrees that motivation affects performance, but there 
are arguments over the extent to which it affects the learning of habits 
or of cognitive structures. The problem is an empirical one, but the 
goals of the learner almost surely affect both learning and performance. 

(Motivation determines attention, and attention is related to cognitive 
restructuring, the form of learning that is supposed to be most detached 
from performance. ‘The effect of motivation upon learning and perform- 

ance is not symmetrical. A starved person may be motivated to eat but not 
to learn new table manners. Table manners are probably better learned 
when the hunger drive is less strongly aroused. ) 

The introduction of personality variables into conceptions of drive 
(eg, anxiety or need for achievement as measured by tests) suggests 
an important aspect of motivati 
the relation of the given circumstances to the more persistent goals of the 
individual. If tiere is any teaching that has come to be generally accepted 
as a result of Freudian psychology (and related conceptions of person- 
ality), it is that motives are organized in some sort of hierarchy within 
the individual, resulting in a value-s) stem expressed in behavior. This 
system may go by such names as “ego or character-structure,” but 
5 becomes very important for the learning of that 


on long neglected in learning theories: 
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fixed alternatives; in the maze and multiple-discrimination experiments, 
successive choices must be integrated in proper order. 

Two main theoretical problems arise: the problem of the nature 
of the original adjustment ending in the correct act, and the problem 
as to how this adjustment is facilitated when the situation is repeated. 

Thorndike’s explanation has tended to separate the two problems. 
The initial adjustment is a result of the tendency of the organism to vary 
its responses (“law of multiple response”). The acts within its repertory 
run off more or less by chance until the correct one is hit upon. Then 
the second principle comes into play. The correct response is stamped in 
(“law of effect”). When the situation recurs, the rewarded response has 
a favored position in relation to the rest of the possible multiple responses, 
and tends to occur earlier than before. Because prior experiences will 
have established preferential orders of response even in a situation rela- 
tively novel, the multiple responses will never, in fact, be purely random. 
Hull has worked out a theory like Thorndike’s, giving an orderly account 
of these possibilities, 

The alternative is that the original behavior is not the running off of 
earlier habits in the new situation, but is a genuine attempt at discovering 
the route to the goal. Past experience is used, but in a manner appro- 
priate to the present. Such an interpretation makes the original adjust- 
ment a provisional try, to be confirmed or denied by its success or failure, 
What is here being called a provisional try corresponds to what Tolman 
and Krech have called “hypothesis” behavior. The theory supposes that 
a provisional behavior route is kept in suspension until its consequences 
change its provisional status; if it is confirmed it is an appropriate path 
of action to be followed under like circumstances, 

When the provisional try was first suggested as appropriate for 
much behavior classified as trial and error,'* the concept of feedback 
mechanisms had not yet gained wide currency. All that the provision 
try implies is a feedback that corrects the provisional responses 
to their consequences. The conception of provisional try, and of 
suggests that the role of reinforcement is essentially informat 
or less what Thorndike implied when he talked of the con 
“O.K.” response. 
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There are some consequences for quantitative theories if the informa- 


tive role of reinforcement is accepted. The relationship between the 


informative role of a stimulus and its facilitating role is not one 
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softly as well as when they are shouted (though there are some differ- 
ences, as every army recruit and army sergeant knows! ). Hull theorized 
that habit strength requires some (unspecified) minimum of drive- 
reducing reinforcement, but is not dependent on the amount of drive 
reduction beyond the necessary minimum. Hull's position can be as- 
similated to an information theory. The minimum reinforcement neces- 
sary for habit strength is that required for information, for feedback; 
excesses bevond that affect reaction potential but not habit. 

The conception of the provisional try is especially appropriate when 
the movement called for is not clearly specified by the apparatus. In 
mirror drawing or pursuit learning or ball tossing the performance called 
for is a pattern of movements correct in direction and (in some cases) in 
rhythm. The general character of the correct movement is understood at 
the outset, but it cannot be produced without considerable fumbling. 
This fumbling is a real try. The learner does not act in chance fashion, 
no matter how clumsy his movements may be. In this kind of situation 
either Dodge's designation approximation and correction or Woodworth’s 
trial-and-check is appropriate. Theories of spontaneous movement and 
of conditioning of previous responses have never been successful in deal- 
ing with the detailed movements that occur in manual skill. The sug- 
Sestion that any new movements must be compromises between move- 
ments elicited as old responses to familiar stimuli does not take the 
Problem seriously. The movements are new integrations at first crudely 
adapted to the situation at hand, presently becoming better adapted. 
organism at some early stage in its development 
has to learn how to try,” but by the time it is ready for the usual labora- 
tory experiments it has learned, and its behavior is probably not greatly 
unlike the human learner acquiring a skill. The cat’s behavior in the 
problem-box is best described by saying that the cat is trying to get 
out of the box. When the cat quits trying, and, as Lewin would say, 
“goes out of the field,” its not-trying is evident also. It may lie down, 
lick itself, appear to be relaxed, then suddenly return to a state of tension 
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or an irregular path) require for their explanation: (1) the goal of re- 
maining on the target, and (2) the perception of discrepancies from 
the target, resulting in corrective movements. This is provisional-try 
behavior, corrected by feedback from the consequence of the attempted 
positional movements. 

The “tracking” behavior of a hunting dog gives a naturalistic illustra- 
tion of similar behavior. Woodworth, for example, long ago used the 
behavior of the dog as an illustration of “seeking” behavior: 


That there is a persistent inner tendency towards the consummatory reac- 
tion is seen when, for instance, a hunting dog loses the trail; if he were simply 
carried along from one detail of the hunting process to another by a succession 
of stimuli calling out simple reflexes, he would cease hunting as soon as the trail 
ceased, or follow it back again; whereas what he docs is to explore about, seeking 
the trail, as we say. This seeking, not being evoked by any external stimulus (but 
rather by the absence of an external stimulus), must be driven by some internal 
force; and the circumstances make it clear that the inner drive is directed toward 
the capture of the prey.?? 

This behavior, controlled by some sort of “fit” between the perceived 
environment and the goal-seeking behavior of the organism, involves 
the assimilation of feedback information along the route as well as at 
the end. It descriptively justifies a concept such as that of the provisional 
try. The incorporation of this behavior into a more general theory re- 
quires further steps that have not as yet been taken. 


Overlearning and Stereotypy 


A theory which assumes that behavior is regulated reasonably 
on the basis of available information is put to it to account for the per- 
sistence of habits in situations in which they are no longer adaptive, 

One kind of stereotypy, normally adaptive, is a consequence of 
overlearning. By overlearning is meant the facilitation of response with 
repetition after the essential learning has been mastered. Memorization 
furnishes convenient illustrations, A series of digits can be learned in a 
single presentation so that it can be correctly repeated if recall is im- 
mediate. But if the series is long enough to be near the limit of immediate 
memory, more repetitions are needed to fix the list so that it can be re- 
called after greater lapse of time. The most evident effect of overlearn- 
ing is this one upon recall. It provides the explanation for the long 
retention of overlearned skills such as swimming or bicycle riding. 

But overlearning has other consequences than guar. 
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available. One possible result is a mechanization of habit, leading to a 
loss of “docility.” Oft-repeated activities take on some measure of goal- 
character, perhaps through a process like secondary reinforcement. The 
learner is then “blinded” to new possibilities.“ Rats which have learned 
a path including a jump appear to prefer that path to other more eco- 
nomical ones after thev have been overtrained on it.“ Maier and his as- 
sociates interpret some of their data as showing that habits learned in 
accordance with original preferences are more easily altered than habits 
built up against original preference If this be generally true, it may 
account in part for the persistent preferences by adults for highly arti- 
ficial ways of doing things learned as children.** 

i of doing things is commonly economical, and there 
is no special mystery about its persistence. It has had motivational sup- 
port according to any theory. There are other instances of stereotyped 
. which have had no such history of support as adap- 
additional problems. 


The familiar way 


fixations, however, 
tive habits, Their occurrence raises 

One source of fixated behavior is the insoluble problem. There is no 
Way in which to make a reasonable adjustment, because the order of events 
in the environment is chaotic. There is no basis for choice, and a ritual- 
istic fixed choice is adopted. Such a ritualistic performance represents be 
giving up of the attempt to predict the ocgùrreneg of reward on the basis 
of “correct” behavior. The environment is unresponsive to the trying by 
the learner. It is not surprising that such a pattern of behavior persists 
4gainst punishment. Maier’s finding that there was evidence that positive 
and negative cards were in fact discriminated, even though the position 
habit femnained fixed, means in cognitive terms only that the organism 
finds the situation too difficult. The problems which experimenters set 
animals are often fairly complex when translated iito our own experi- 
ences, In this situation the animal learner, in adopting the fixated habit, 
has rejected the second window as no better than the first, since its 
Provisional tries were frustrated as frequently a one place as the other, 
Because there now develops a pattern of predictable success and failure 
at the first window does not mean that things would be any better at 
the other one. A human parallel might be as follows. Suppose that in 
long experience the front door and the rear door of the laboratory had 
always been unlocked together. Even though one knows about both 
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the solutions have to be discovered or invented through a reconstruction 
of experience appropriate to thg problem at hand. This is not completely 
specified by experiments on insight, but it is suggested by them. ac 

There are several alternatives. One is to treat all problem-solving 
and reasoning as a form of trial-and-error learning, with some transfer 
to account for short-cut solutions. This is the position of Carr (1925), 
endorsed by McGeoch (1942). This position fails to grant any reality 
to the reconstruction of experience present in the act of reasoning which 
makes the “try” something other than the running off of varying habits. 
A second alternative, that of Hull (1935c), is to recognize that the mecha- 
nism for assembling prior experiences appropriate to problem solution 
has in it characteristics different from ordinary trial-and-error learning. 
Hull attempts to derive such behavior from other processes. This deriva- 
tion depends upon discrimination among the fractional anticipatory re- 
sponses that are common to several experiences and have 
grated in order to solve the problem. 

Stimulus-response psychologists, under the restraint of Lloyd 
Morgan’s principle of parsimony, have assigned as little as possible to 
ideation, but they have usually recognized that some behavior is con- 
trolled by ideas. Thorndike called ideas by that name and granted their 
occurrence; Hunter recognized a “symbolic process” necessary to account 
for behavior in delayed-action and double-alternation experiments; * Hull 
inferred “pure-stimulus acts,” that is, responses whose sole function is 
to furnish stimuli to guide or integrate behavior sequences; Guthrie’s 
position is about like Hull’s, with great emphasis upon movements (es- 
pecially implicit speech) as the substance of thought. Tolman, in be- 
haviorist spirit, makes of ideation a “mere ‘behavior-feint’ at running- 
back-and-forth,” without specifying the precise nature of the feint, 

Is there any gain, having accepted ideation, in secking a movement 
basis for ideas? To assert that the organism can discriminate between 
the sensory consequences of two kinds of anticipatory salivation more 
readily than between two perceptual redintegrations is no great gain, 
except for the logical possibility of discovering by separate techniques 
the presence of two kinds of salivation, Because both perceptual redinte- 
grations and stimuli from anticipatory responses are, in fact, inferred, it 
is as possible to be logically strict about one as the other—and the evi- 
dence used in the inference for one has to be as good as 
the other. The benefit of the doubt need not go to th 
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movements. There is some concealing of ignorance in attributing specific 
stimulus-response bases for psychological functions such as drives, sets, im- 
ages, and thoughts. If we are critical about accepting the results of experi- 
ments, then stomach contractions are not the basis of appetite, eye move- 
ments are not the cause of the Miiller-Lyer illusion, kinesthetic cues are 
not the preferred ones in maze learning, tongue movements are not the 
basis for thinking. The burden of proof is on those who believe otherwise. 
That there are some movement concomitants of thought is insufficient 
evidence that these movements (and their consequences) are the suffi- 
cient condition for the occurrence of the thought. The argument that 
inferring movements will make easier the task of the neurophysiologist 
does not hold. It is no harder to make a brain model for perceptual red- 
integration than for the pattern of discrimination among the proprio- 
ceptive consequences of anticipatory responses. im 
The alternative to trial-and-error and reinforcement theories is to 
make allowance for a genuine reconstruction of experience appropriate 
to problem solution, as proposed by Tolman (1982b) in his principle of 
creative instability and by Maier (1931) in his separation of reasoning 
from other kinds of learning. An organism capable of functioning at 
different levels does not keep his abilities in abeyance to be brought out 
an experimenter. Therefore if a given ability 
is shown in clear outline in one experimental situation, it may be sup- 
Posed that if looked for it might also be found in other situations in 
Which it is not as clearly exhibited.*? Thus the reconstruction of experi- 
ence goes on to some extent in every act of redintegration, When ice is 
seen as cold, or a distant object as a motorcycle, O a not 
Photographic, nor are they simply revivals of gina os aa eae 
Perceptual achievements molded out of a re nie ca l oe ak 
Sensory discriminations. It is only the more suulang recon nations a 
and present perceptions that we call reasoning. 
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already exists is one way of solving such a problem. It is iis sud-error 
behavior, but in a direction against the attraction of the goal. Another 
solution is that of discovering or creating some sort of tool which then 
becomes a means to the goal. This often requires a reconstruction of 
experience, perhaps by putting some familiar object to unfamiliar use: 
making a pliers into a pendulum-bob, a clamp into a hat-hanger, a 
Bunsen-burner tube into a blow-pipe, and so on. It is this reconstruction 
aspect rather than the detour aspect which is most significant in reason- 
ing experiments. 

The reconstruction of experience follows certain lines of solution 
which Maier calls “direction.” If there is a flame to be blown out, one 
“direction” is to find some way to direct a jet of air upon it, another 
direction is to use water, still another is to smother it with something 
solid. Each of these suggestions directs the search for appropriate objects 
to convert into means. In one of the psychological laboratories during 
World War II a glass plate was being used in a piece of apparatus in 
such a way that it was to be looked through and also used as a mirror, 
somewhat as in the Dodge tachistoscope. There were troublesome double 
images of a small fixation object due to reflections from both the front 
and the back surfaces of the glass. The first hypothesis was that the glass 
could be made so thin that the images would fuse. Successively thinner 
sheets of glass were used in following out this “direction,” but without 
solving the problem. The idea emerged of making the glass much thicker, 
so that the images could be drawn apart and one of them screened off. 
This worked. The direction “if we can m 
hypothesis used until the other hypothes 
enough” supervened,** 
perience of insight. 


ake it thin enough” was one 
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Understanding and inference may be used in unsuccessful solutions 
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a restructuring, a seeing of things in a new light, appropriate to the prob- 
lem at hand. 

Because the integration and reconstruction of experience go on in 
many acts of learning, reasoning need not be made a separate category 
as done by Maier. This does not mean that there may not be one or more 
special capacities involved in reasoning ability and in other sorts of 
creativity. But there may be different learning processes below the rea- 
soning level as well. In ‘order to avoid artificial boundaries, it is prefer- 
able to include together in one family of processes all forms of problem- 
solving and reasoning, subject to later determination of the kinds and 


number of processes involved. 


To What Kind of Theory Would the Foregoing 

Comments Lead? 

The comments on the several kinds of problems facing learning 
theorists do not in themselves constitute a theory of learning, yet they 
express certain desiderata for a theory. If they were to be co-ordinated 
with any one of the preceding viewpoints, they would fit best with the 
tolerant functionalism recommended by Woodworth. 

The merit of the broad functionalist position is the willingness to 
accept new dimensions as they are discovered, to tolerate some theoreti- 
cal inconsistencies rather than fly in the face of well-demonstrated experi- 
Mental relationships. Such a positivistic attitude has worked well in other 
Sciences, Even the elegant science of physics has stumbled along for a 
good while with inconsistent theories such as the wave theory and the 
Biologists do not worry too much that their gene 


quantum theory of light. 
s of embryological development are not yet satis- 


theories and their theorie 


factorily į 
tegrated. r ; : 
nin aisles with functionalism is that it does not satisfy the esthetic 


esire for a neat system. The motivation to seek for a few postulates, to 
find interchangeable constants, to be able to move logically from one 
Part of the system to another has worked well in the physical sciences, 
and there is every reason to expect it to work well in psychology also. 
Functionalism has no quarrel with this possibility, and it is entirely pos- 
sible to seek systematic elegance within the tolerance of a functionalist 
Position. In the end, we shall know more and find more convenient ways 
both of ex pressing what we know and of finding out still more. That is 
ultimately ahat wucarenie however that end is achieved. 
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AN EXPERIMENTAL PROGRAM 


In the foregoing suggestions toward a theory of learning the errone- 
us impression may easily be given that the concepts represent a return 
to the pre-experimental period of psychologizing about learning. Accept- 
ance of such a point of view might be interpreted to mean turning our 
backs upon the advances which have come through rejecting mentualistic 
and mystical concepts in favor of objective behavioral ones. In order to 
allay such fears and correct misapprehensions, I wish to propose what 
appear to me to be some directions in which contemporary research upon 
learning might move. 


Naturalistic Observations 


Because the search must continue for the most appropriate concepts 
to be used in the study and description of learning, it is desirable to keep 
the approaches flexible and to welcome novelty as well as precision. In 
urging consideration for a concept like the provisional try, I have not 
intended to discard the results of experimentation which have been ob- 
tained under other concepts, but only to break through the rigidity which 
makes us think of nonsense syllable lists, mazes, conditioning experiments, 
and target practice as exhausting the phenomena of learning. 

The common acknowledgment of the ingenuity and originality of 
the experiments by Lewin and his students (even by those most devas- 
tatingly critical of his conceptual system) leads to a search for the roots 
of this freshness of approach. The answer probably lies in Lewin’s tend- 
ency to pick problems out of real life situations, instead of choosing 
examples which would fit a previously learned method of study. It is out 
of real life one finds problems such as the forgetting of an intention, the 
tendency to resume interrupted tasks, and so on. Such problems are not 
thought up by students whose only concern is to counterbalance practice 
effects in a standard experiment in such a manner as to add an additional 
decimal point to the determination, 

Somewhat the same thing might be said about Freud’s contribu- 
tions. Because Freud was listening to real people in their efforts to recall 
and to come to grips with their recollections, he discovered f 
repression and distortion in memory which were unknown 
lowers of Ebbinghaus. 

It would probably be worth while for someon 
what Brunswik has been doing for perception and 


acts about 
to the fol- 
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velopment **—that is, take a notebook and follow a child around for 
hours at a time. What are the circumstances under which he learns? 
What performances provide for recall? for recognition? for problem- 
solving? What tasks are left half-finished? What kinds of forgetting are 
there? What motives can be inferred? Such observations would be like 
a breath of fresh air in most learning laboratories. To be sure, many 
observers would see only what they were taught to sce. Some would find 
nothing but trial and error, others would see the child doing only what 
he last did, others would be impressed by the child’s insight. But of a 
group who started out in this manner one or two would probably come 
back with suggestions as fertile as those of Lewin and his students. Such 
suggestions would help break the sterility of contemporary learning 
theories, and would help in the search for appropriate new concepts. 


Physiological Changes Concomitant with Learning 
No one should infer because this book has largely neglected physi- 
anti-physiological. On the contrary, it is a blot upon our 


ology 3$ that it is 
that after so many years of search we know as little 


Scientific ingenuity À i 
as we do about the physiological accompaniments of learning. 

Learning theories may be developed at a molar level, independent 
of neurophysiology. It is preferable that the division of labor between 
Psychology and physiology be recognized, and that psychological theories 
be developed objectively and self-consistently at the molar level, with- 
out recourse to a hypothetical neurology or a physiology not grounded 
in demonstrated relationships. Once, however, the possibility of such a 
ed—and there appears to be increasing agreement 
on this point—then an appropriate position with respect to neurophyši 
ology can be established. The learning theorist ee niot have to wait 
or physiology in order to go ahead on firm pea fic ground but this 
does not mean that he will not welcome any advance in knowledge in 
this ficld, nor that he will not wish to contribute to such knowledge 
thro E xperiments. 

a ai ae needs is for a crucial seg a mie 
Specifically a change occurring in neural tissues pra pira a EA ds 
related to such tissue) as learning takes place. ‘ossibly the mst task is 
to find an organism as appropriate for this purpose as the fruit-fly has 
been for genetics, The white rat is probably too complicated. Perhaps a 


learning theory is grant 


and Wright (1954). 


“4 Brunswik (1944). Barker N 
2 Except for the brief discussion in Chapter 13. 


482 THEORIES OF LEARNING 


simpler organism like amblystoma should be chosen, in which the nervous 
elements are few enough to be under direct inspection with modern 
techniques. It is possible to teach larval amblystoma simple habits, 
and it ought to be possible to find out what sort of “trace” these habits 
leave. To refuse to search for such a change would be like refusing to 
look for the micro-organism causing malaria just because an objective 
plot of the temperature fluctuations in the patient could be made a means 
of identifying the disease. It can be stated with reasonable confidence 
that there are changes in the nervous system accompanying learning, 
and, if there are, then there are surely no insurmountable obstacles to 
detecting such changes. It is a surprising thing that, with all the interest 
in learning, this direct identifying step has not been taken more success- 
fully. 

The other possibilities, besides the study of the simpler organism, 
include the study of special preparations, such as the spinal animal. Elec- 
trical methods of exploring the cortex may succeed, but they, like be- 
havioral studies, are likely to be at a molar level from which the 
identifying steps still have to be made inferentially. 

Much that is known and can be found out about learning will not be 
altered by changes in neurophysiological knowledge. This is the basis of 
the argument for learning theories which dispense with physiology, and 
it is sound. But it would be foolhardy to suppose that a knowledge of 
the physical substratum of learning would not be helpful in further ad- 
vancing our theories and in achieving control over the learning process. 
Such knowledge is indeed so important that psychologists interested in 
learning will wish to continue in the future, as they have in the past, to 
share with physiologists and neurologists in experimentation designed to 
get at the changes in the organism which go along with manifest behavior. 


Standardization of Methods and Measures 


Scientific facts ought to be gathered in such a way that later workers 
can move ahead on the secure foundation of fact accumulated by earlier 
workers. In more advanced sciences, such as chemistry and physics, large 


handbooks are devoted to tables and formulas conserving this accepted 
body of knowledge. 


The handbooks of psycholo: 
as a storehouse of important e 
relatively few tables of measur 
starting points for later work. 


8y are not of this kind. While they serve 
xperimentation and theory, they contain 
ements or constants that can be used as 


Suppose, for example, a student comes to 
#8 Moore and Welch (1940), 
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one of us and asks: “Will you please measure my reaction time, and tell 
me how I compare with others?” Of course we know how to connect a 
telegraph key in circuit with a sounder and an electric timer. But do we 
know how to secure results that can be compared with norms? How many 
practice trials, or perhaps sessions of so many trials each, should there be 
before we begin to count? How many measurements do we need for a 
stable value? A ready signal? The interval between ready signal and 
stimulus? If an auditory stimulus, how loud? A useful start was made by 
those who assembled what relevant material they could find in a Hand- 
book of human engineering data,“ but the very limitations in what they 
could find helped to show how inadequate most of our standardization 
is. The two studies of simple auditory reaction time that they report 
yielded means of 192 msec. and 153 msec., the second mean being a 
full standard deviation below the first. Of course the conditions differed, 
but that is why we would have difficulty in measuring our student’s re- 
action time and knowing whether it was long or short, unless we troubled 
to run our own normative group. : 

If we are to make advances in quantitative theories of learning we 


need data that are comparable, and this means more standardization than 


has yet been achieved. as 

Two arguments are used against standardization: 

1, Standardization may prove wasteful until our conceptual systems 
have settled to the point where we know what we want to measure. If 
We proceed to standardize now we may spend a great deal of effort on 
the trivial, or on some device or method tied to outworn concepts. For 
example, we might finally achieve a calibrated set of nonsense syllables, 
Just to find that those working on rote memorization prefer to use mean- 
ingful words in their experiments. 

2. Standardization, because it demands conformity, might put a 
damper on originality, and hence achieve comparability at too great a 
Cost, The “irain ed observer” of early introspective psychology zan this 
danger, and some suspected that students observed the same things as 
their masters simply as a result of this “standardization” of what was 


Permissible in a report. 


Both of these arguments are somewhat misleading. Consider first 


Whether or not standardization is premature. Nobody would Attempt 
large-scale standardization of equipment and methods except in fields 


in which there is widespread current activity. If the fields later became 


v ardization was a mistake. 
inactive, that would not prove that standardizati e. It 
(1952). 
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conceivably might have helped to settle some issues that had previously 
remained open. Furthermore, without standardization as much effort 
might have been spent in the meantime on unstandardized procedures 
with less definitive results. 

The second argument, that standardization cripples novelty and 
imagination, does not necessarily hold. Suppose that one wishes to in- 
troduce a new-style memory drum, or to introduce a new material to be 
learned, after standardization has been achieved with an old-style drum, 
with other material. All that is necessary is to perform calibration experi- 
ments to show that the two drums give comparable results (if they do) 
with the original material. Then the new drum can be recommended if it 
is more convenient. The new materials then need to be standardized also, 
against the norms of the previously standardized material. Thus the new 
equipment and materials, if they merit it, can be added to the store of 
standardized materials, and normative data can be published where such 
data are accessible.” 

Without standardization, failure to repeat earlier results commonly 
leads to the excuse that conditions may have differed. This excuse needs 
to be eliminated, so that we can repeat experimental data when we try 
to. Then we can go on to a new experiment, with specified modifications, 
and criticize the interpretation of the earlier data. Standardization thus 
need not prevent originality. The gain is that results under novel condi- 
tions can be compared confidently with results under standard condi- 


tions, so that the new results can fit into the historically accumulating 
body of scientific knowledge. 


Broadening the Boundary Conditions 


For some workers, the process of standardization, because it takes 
time and requires controls, is likely to result in a narrowing of the initial 
and boundary conditions of some varieties of experiment through specify- 
ing pretraining, apparatus dimensions, and the like. But again we must 
recognize a division of labor, so that the experimenter does not have to 
be coerced into doing what he does not find congenial. For 
menters, the most inviting task is to broaden the bound 
both experiment and theory. This is, of course, a re 
naturalistic observations. But these observations will 


some experi- 
ary conditions of 
ason for encouraging 
not serve well unless 
they are translated into experimentable form, For example, even now new 


#8 The parallel with intelligence test standardization and revision is obvious. Such 
tests, because their norms are taken seriously, have been better “calibrated” 


than much 
laboratory apparatus. 
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tichness is being introduced into learning experiments by including mo- 
tives formerly little studied in the laboratory. So many human motives 
are personal—emulative, defiant, aggressive, appealing—that a social 
psychology of learning is sure to develop eventually.** 


RELATION BETWEEN LEARNING AND THE PRACTICAL 
DEMANDS OF INSTRUCTION AND TRAINING 


Theories serve more than one purpose: they attempt to organize 
pt to provide guiding threads or hypothe- 


existing knowledge, they attem 
also furnish principles by 


ses toward new knowledge, and they may 
an be used.” This practical outcome is seldom cen- 
constructor of theory, and it is not surprising, 
from the learning theorist often 
Newman, writing on the con- 


Which what is known ¢ 
tral in the thinking of the 
therefore, that the person seeking advice 
comes away disappointed. For example, 
temporary status of learning theory, says 
i d in batting a home run, fencing, boxing, fancy div- 
tis worth as much as the psychologist’s 


that for many practical skills 
such as those require 
ing, or dancing, “a rabbit's foot i 
sage advice.” # While Newman's ske 
Puts us on notice not to expect from the learning theorist too much that 


1S practical. 


pticism here is extreme, he at least 


Some Agrecment on Practical Matters 

any of the quarrels of the theorists are 
r n to immediate practical prob- 
tically important experimental 


It turns out, however, that m 
important in relatio 


a great many prac 
the theorists are in substantial agreement. 
That is, they accept the facts as demonstrated, even though they dis- 
agree about the interpretations of these facts. If the theoretical differ- 
ences are irreconcilable, and one position eventually wins out over the 
other, there will ultimately be an ¢ i 
tical people today need not wait for the resolution of these theoretical 
Controversies, 

Here are a few statem 
of learning theorists to agree- It wou 


Agreement, for some statements require m Sins 
always a few theorists who are sticklers for wording. 


iin ones, not very 
ems; there are, in fact, 
relationships upon which 


fect upon practice. But advice for prac- 


ents upon which I would expect a majority 
Id be too much to ask for perfect 
any qualifications, and there are 


89 For 2 inning, see Rotter (1954). 5 
* Much Serious beginni P apted from Hilgard (1958b). 
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1. In deciding who should learn what, the capacities of the learner 
are very important. Brighter people can learn things less bright ones 
cannot learn; in general, older children can learn more readily than 
younger ones; the decline of ability with age, in the adult years, depends 
upon what it is that is being learned. 

2. A motivated learner acquires what he learns more readily than 
one who is not motivated.*? The relevant motives include both general 
and specific ones, for example, desire to learn, need for achievement 
(general), desire for a certain reward or to avoid a threatened punish- 
ment (specific). 

3. Motivation that is too intense (especially pain, fear, anxiety) may 
be accompanied by distracting emotional states, so that excessive moti- 
vation may be less effective than moderate motivation for learning some 
kinds of tasks, especially those involving difficult discriminations, 

4. Learning under the control of reward is usually preferable to 
learning under the control of punishment. Correspondingly, learning 
motivated by success is preferable to learning motivated by failure. Even 
though the theoretical issue is still unresolved, the practical outcome must 
take into account the social by-products, which tend to be more f 
able under reward than under punishment, 

5. Learning under intrinsic motivation is preferable to learning un- 
der extrinsic motivation. 

6. Tolerance for failure is best taught through providing 
of success that compensates for experienced failure. 

7. Individuals need practice in setting realistic goals for themselves, 
goals neither so low as to elicit little effort nor so high as to foreordain 


to failure. Realistic goal-setting leads to more satisfactory improvement 
than unrealistic goal-setting. 


8. The person 
to authority, 


teacher, 


9. Active Participation by a le 
tion when learning, for example, fr 

10. Meaningful materials 
readily than nonsense materi 
stood by the learner, 


avor- 


a backlog 


al history of the individual, for example, his reaction 
may hamper or enhance his ability to learn from a given 


arner is preferable to passive recep- 
om a lecture or a motion picture, 

and meaningful tasks are learned more 
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11. There is no substitute for repetitive practice in the overlearning 
of skills (for instance, the performance of a concert pianist), or in the 
memorization of unrelated facts that have to be automatized. 

12. Information about the nature of a good performance, knowledge 
of his own mistakes, and knowledge of successful results, aid learning. 

13. Transfer to new tasks will be better if, in learning, the learner 
can discover relationships for himself, and if he has experience during 
learning of applying the principles within a variety of tasks. 

14. Spaced or distributed recalls are advantageous in fixing material 
that is to be long retained. 

These points are neither systematic nor comprehensive. They are 
listed merely in order to add concreteness to the suggestion that there 
are a number of useful generalizations about which students of learning 


are in substantial agreement. 


Improving the Contributions of Psychology to the Practical 
Understanding and Control of Learning 

eater help in gearing itself to problems of 
ms of learning, there are a few 


If psychology is to be of gr 
taining and to the other practical proble 
Suggestions that can be offered. l 

l. Validate the principles developed in the laboratory in real life 
situations. A theoretical principle cannot simply be borrowed and used 
to guide practice. It must be first tried out under the conditions of actual 
life. In real life, learning occurs in a social setting, with constraints of 
time and personnel and equipment. An instructor teaching a course, be- 
cause he does not have complete charge of his students, is not free to 
t everything he would like to do. Hence a laboratory learning principle 

as to þe tried out in the classroom before we can be sure that it is useful. 


2. Be tolerant of negative results. If two methods of teaching or 


training are to be tried out experimentally, we ought to accept for trial 


only methods that have a good deal to commend them, for example, 
Skilled teachers who favor them. Under these circumstances we are very 
likely to come out with differences that are not significant statistically, 
Sven though the experiment is well designed and je ee 
Ostead of being disappointed with such a finding, an cae ng it, the 
result is well worth knowing. Within the practical limitations of classroom 
teaching it is quite possible that alternative methods are equally ac- 
Ceptable, and this may prove Very comforting to teachers who prefer one 


Or the other method. 


8. Enlist the co-operation of inventive practical people. Theories in 
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the psychology of learning do not automatically design or invent a 
ing methods. Very often experienced teachers have a flair for finding 
ways to motivate students, to gain their participation, to suit learning 
tasks to their ability levels. The psychologist seeking to give practical 
help will seek to collaborate with such persons, using his own methods, 
to be sure, to validate their procedures, but capitalizing on their in- 
ventiveness and experience in a situation with many features not present 
in his laboratory. 

4. Lean more heavily on studies of problem-solving and creativity 
than on studies of rote learning. The old “principles of economy” in 
memorization and in acquiring skill depended very largely on studies of 
rote learning, and hence emphasized massed and distributed practice, 
learning by parts or by wholes, consequences of warm-up and fatigue, 
the effects of recitation, and so on, While these principles have some 
relevance for learning, they are not very applicable in practice. The 
boy learning to ride his bicycle does not need advice on distributing prac- 
tice. If he assembles a motor car, he doesn’t need to be taught to space 
his reviews in order to remember the name of the carburetor or generator. 

OF far more practical importance is the ability to make relevant use 
of past experience in facing new problems, of maintaining motivation un- 
til a difficult task is completed or a baffling problem solved, of gaining 
confidence in oneself as a creative person, of learning how to diagnose 
a problem, how to fill gaps in necessary knowledge, how to call on ex- 
perts when necessary, In a rapidly changing culture such as ours in 
America, learning how to go about learning in the face of new demands 
is more important than learning to commit seri 


presented problems, 

A closer integration of theor 
hoped that an incre 
tractive. 


y with practice is desirable, and it is 
asing number of psychologists will find this task at- 


Pure versus Applied Research 
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nutrition. As theoretical and experimental science mature, the relation- 
ship between science and invention becomes more intimate. Gunpowder 
could be invented before physical science was far advanced, but the 
atomic bomb could not have been made without calling upon the re- 
sources of pure science. The scientific study of learning is today in an 
intermediate position. While its results can be helpful to educators and 
parents, there are many problems in the practical guidance of learning 
Which cannot yet be handled upon the basis of well-established principles. 

At any one time, scientists in experimental and theoretical fields 
can work only on the problems upon which they are prepared to work 
with the conceptual and material tools at hand. These may or may not 
be appropriate to the practical problem calling for solution. Mothers must 
feed and rear their children as best they can while the children are grow- 
ing up, and physicians have to treat their patients when they are ill. 
Practical pursuits cannot wait. Hence some sorts of adjustment are made 
in the practical world, while the scientists go about their business. Chil- 
dren may have rickets because at the time nothing is known about 
vitamins, and patients may die because a new curative drug has not yet 


emerged from the laboratory. Scientists could not be blamed if their 


science had not yet discovered vitamins or was not yet prepared to 
synthesize the new drug. 

Because scientists have to develop l ; ; 
cepts before their results can become efficient regulators of practice, their 
Concerns for a time may appear to be remote from practical affairs, and 
Some of their disputes will seem to be quibbles over distinctions that do 
Not matter, All this suggests the need for patience and tolerance toward 
experimentation which pushes back the boundaries of the known and 
toward theory construction which attempts to sharpen the conceptual 
tools with which scientists can work. ; 

One faulty interpretation of the xelationship Meter l pure a 
applied science is to be avoided. This is the interpretation aH applica- 
tions, if they are to have any verifiable naa ae = ai is a 
Pure science ready to be applied. A corollary o this Hi — be 
the confident hope that once eg) eae F ciil aeo g ‘ie 
Will follow automatically. Neither the pe same n ae tid 
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ing would rest in part upon the findings in the experimental studies of 
learning, but it would consist in much more than the making of sug- 
gestions on the basis of general principles. There must finally be experi- 
mental testing in the school, or on the playground, or in the shop—wher- 
ever the application is to be made. 

If in the end a principle has to be experimented upon at the point 
of application, it might be asked, why do laboratory research at all? If 
what we do with animals has to be repeated with children, why not 
work with children in the first place? There are several answers. For one 
thing, we do some things with animals which we cannot do with children. 
For another thing, we make many false starts in the laboratory, and it 
would be wasteful to subject school children to all the things we try 
in the laboratory. Again, in the laboratory we are able to control condi- 
tions as we are not able to control them in the school. A principle once 
discovered in a better controlled situation can be validated in a less-well- 
controlled one.** Even though the sole aim of learning experimentation 
were practical, it might prove economical to work in the laboratory in 
order to find the leads worth testing in practice, 

Research on practical problems, if well done, not only adjusts theory 
to practice, but contributes to substantiating, refuting, or extending 
theoretical knowledge. It is a mistake to distinguish too sharply between 
pure and applied research, if both are good research. It is true, however, 
that much applied research is necessarily concerned with problems bound 
to specific times and places, and so lacks the universality of pure re- 
search, It is these local and temporary features which lead to the judg- 
ment that applied research is inferior, and tend to reduce the prestige 
of the applied scientist relative to the pure scientist. When the relation- 
ships between pure and applied research are properly worked out, the 
requisite skills are of a high order however the labor is divided, and there 
is no reason why prestige should not be more equitably distributed. 
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Functional analysis (Skinner), 96 
Functionalism, 8, 328-366, 464-465, 479 
adequate response in, 340-341 
characteristics of, 859-362 
Dewey’s views, 329-331 
distributed practice theory, 350-362 
history of, 328-329 
motivation role in, 380, 333, 387-340 
orientation of, 349-350 
preferences of, 333-336 
rate-of-learning factors, 341-343 
as theory of learning, 337-348, 857, 
363-366 
transfer of training and retention in, 
344-348 
typical problems of learning and, 362- 
365 
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on punishment, 60-62 
puzzle-box experiments and, 64-69 
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Generalization, 51, 52 . 
Generalization-differentiation theory 


(Gibson), 350-353 
Generalized reinforcers, 95-96 
Generalized stimulus interrelationships 
(Hull), 139-142 
Geometry, in Lewin’s theory, 262-263 
Gestalt theory, 222-255, 328, 407 
adhesion principle rejection, 261 
drill versus understanding, 249-251 
Hull’s views and, 142 
insightful learning, 233-240 
Lewin’s views and, see Lewin’s views 
memory change, 243-246 
new learning and, 232 
organization, laws of, 226-229 
perception and learning interrelation- 
ship, 254-255 
phenomenological standpoint, 253-254 
productive thinking, 240-243 
repetition in, 232-233 
le of past experience in, 229-232 
Thorndike’s views and, 28 
Tolman’s views and, 187 
typical problems of learning and, 252- 
253 
Gibson’s views, 350-353, 355, 361 
Goal gradient (Hull), 153-160 
Goal reaction (Hull), 148-150 
Goals: 
Lewin’s views on, 276-278 
perceptual, 466-467 
see also Rewards 
Goal-set (Woodworth), 338-339 
Good continuation, law of, in gestalt the- 
ory, 228-229 
Gottschaldt’s experiment, 230 
Gradient of reinforcement (Hull), 124, 
153-160 
Gradients of generalization, 140, 442, 444- 
445 
Group dynamics (Lewin), 286 
Guidance (Melton), 340 
Gulliksen’s rational equation, 374 
Guthrie’s views, 2, 8, 48, 53-80, 87-88, 
100, 114, 328, 405, 452, 476 
association by contiguity, law of, 53-57 
Carr’s views and, 339 
clarifying research, 70-76 
Estes’ views and, 74, 389-392 
gestalt theory and, 225, 233, 239 
Hull’s views and, 127-131, 137, 175 
on intentions, 62-63 
invulnerability aspect, 78-79 
on latent learning, 210 
Lewin’s views and, 261, 276 
on motives, 57-58 


on reinforcement, 408 

on reward, 58-60 

Sheffield’s views and, 411 

simplicity aspect, 79-80 

table of experimental results bearing on, 
(2 

Tolman’s 
449 

on typical problems of learning, 76-78 

Woodworth’s views and, 339 


views and, 207-210, 217, 


Habit-family hierarchy (Hull), 124, 160- 
161 
Habits: 
acquisition theories, 10 
breaking of (Guthrie), 55-57 
expectancy versus (Tolman), 197-201 
reinforcement and (Hull), 128-182 
Habit strength: 
fixation as (Sears), 302-303 


repetition-compulsion and (Freud), 
294-297 
Hallucinations, 292 
Harlow’s drive theory, 430-431 


Hebb’s neuropsychological theory, 452, 
454-455 
Hebb’s role-of-incentive theory, 430 
Hodological psychology, 262-263 
Hovland’s views, 354-355 
information-theory, 382 
Hull’s views, 2, 8, 78, 102, 121-182, 328, 
446, 449, 452, 459-461, 464, 465, 
470, 471, 476 
adaptive automatic behavior mecha- 
nisms listed, 176-177 
adient-abient conflict, 166-173 
background of, 121-125 
criticism of, 180-182 
drive in, 423, 426-429 
Freud’s views and, 295 
functionalism and, 360 
gradient of reinforcement, 153-160 
Guthrie’s views and, 121, 127-181, 137, 
175 
habit-family hierarchy, 124, 160-161 
on individual differences, 147-148 
on inhibition, 187-139 
on intermediate mechanisms, 152-161 
on latent learning, 165-166 
Lewin’s views and, 178, 266-271, 276, 
279 
mathematical treatment of, 368, 372, 
374, 394, 398 
Melton’s views and, 340 
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Miller's views and, 412 

postulates, 125-126 

predictive value, 173-174 

reaction potential, 132-137, 142-147, 


178-180 
on reinforcement, 122-128, 177, 208, 
408, 409 


requirements for, 125 
response alternation in trial-and-error 
learning, 161-165 
on rote memorization, 354 
Seward’s views and, 420, 421 
Skinner’s views and, 121, 131, 137, 144 
Spence’s views and, 415-419, 444 
on stimuli interaction, 139-142 
system summarized, 150-152 
Thorndike’s views and, 121, 161 
Tolman’s views and, 132, 189, 196, 
202-212, 217, 218 
on typical problems of learning, 174- 
177 
Watson’s views and, 121 
Woodworth’s views and, 338 
Human subject experiments: 
affective factors in recall, 313-321 
in distributed practice, 350 
doll-play, 190 
expectancy versus habit (Humphreys), 
198-200 
experience, role of, 230 (Gottschaldt), 
231 (Wulf) 
“incidental learning” experiments, 214- 
215 
infant experiments, 49 
memory, 244 (Hanawalt), 245 (Bart- 
lett) (Hebb and Foord), 280-284 
(Zeigarnik) 
nonsense-syllable, 258-260 
problem-solving, 240-241 
recognition thresholds, 318-321 
retroactive inhibition (Müller), 248 
rewards and punishments (Thorndike), 
27 , 
rote memory versus understanding ( Ka- 
tona), 249-251 N 
“spontaneous recall ( 
Restorff), 246-248 ai 
Hunter’s symbolic process, */ 
Hypothesis, provisional expectancy as, 200 
“Hypothesis” behavior (Tolman and 
Krech), 470 
Hypothetical constructs, 


Köhler and von 


12-13, 218-219 


Identical-elements theory of transfer, 24- 


25 
Imitation (Melton), 340 
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Imprinting, 3-4 
Incentives: 

perceived, 166-167 

positive role of, 427-433 
Incentive value (Rotter), 451 
Individual differences: 

functionalist view, 358 

Hull's view, 147-148 
Information-theory model, 377-383 
Inhibition, in Hull’s system, 137-139 
Initial discovery (functionalism), 340- 

341 

Insight: 

concept of, 222, 224-225 

functionalist view, 340-341 

in gestalt theory, 255 

in problem-solving, 10-11 

Thorndike’s theory, 45 
Insightful learning: 

characteristics of, 234-237 

criteria of, 237-240 

see also Gestalt theory 
Instincts, 3-4 i 
Instrumental conditioning, 52, 153, 408 
Intelligence, Thorndike’s theory of, 24-25 
Intensity theory (Carr), 407 . 
Intentions: 

Guthrie's view, 62-63 

Lewin’s view, 262 
Intermediate mechanisms (Hull), 152-161 
Intermittent reinforcement, 90-94 
Interval reinforcement, 90-94 
Intervening variables, 12-13, 122 

Tolman’s views on, 187-189, 218-219 
Introspection, as scientific method, 49 
Inventive ideation, 475-479 
Inventive learning, 203-206 
Isomorphism ( kohler), 254 


Jenkins’ spread-of-effect experiment, 38 


Katona’s views, 231, 249-251 
Kinesthesis, 54 
Koffka’s views, 217, 222-255 
Kohler’s views, 222-255, 47 
on discrimination learning, 434, 435 
physiological support for, 452 
Krech’s view, 470 
on discrimination learning, 434, 435 


Latency period, 309-310 
Latent nig 195-197, 416, 420 
experimental reports, 211-214 
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Latent learning (continued ) 
Hull’s views on, 165-166 
Tolman’s views on, 208-215 
Learning, passim 
avoidance, 61 
as behavior, 107-108 
definitions of, 2-6 
experimental program, 480-488 
illustrations of, 2-3 
obstacles to, 310 
one-trial, 88 
problems of, see Typical problems of 
learnin 
as EE to stages in development 
(psychoanalysis), 307-310 
schoolroom, 106-107 
sign, see Tolman’s view 
theories of, see Learning theory 
Thorndike’s position on, 42-43 
Learning curves, in mathematical form, 
370-375 
Learning theory, passim 
current trends, 407-456 
directions of, 458-460 
early systems listed, 407 
Freud’s influence, 290-325 
functionalism, 328-366 
gestalt, 222-255 
Guthrie’s, 48, 53-80 
Hull’s views, 121-182 
learning capacity, 461-462 
Lewin’s field theory, 258-288 
mathematical expression of, 368-405 
model use in, 375-385, 394-398 
naturalistic observations, 480-481 
need-drive-incentive pattern, 422-433 
neurophysiological interpretations, 452- 
455 
operant conditioning (Skinner), 82-119 
physiological approach, 481-482 
practical aspects, 485-487 
problems confronting, 6-8, 202-203 
psychoanalysis and, 307-321 
psychodynamic, 323-324 
RRT and, 1-2, 487-488 
pure and applied research in, 488-490 
simple conditioning, 462-463 
standardization in, 482-484 
stimulus-response versus cognitive, see 
separate headings 
Thorndike’s views, 15-45 
Tolman’s views, 185-220 
Watson’s views, 48-53 
Leeper’s views, 207, 268-271 
Lewin’s views, 8, 258-288, 328, 471, 480, 
481 
Ach controversy, 258-262 
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on cognitive structure, 273-274 

distance and direction in, 267-268 

on escape, 61 

gestalt theory and, 225 

Guthrie’s views and, 261, 276 

Hull’s views and, 173, 266-271, 276, 
279 

Leeper’s views and, 268-271 

life-space concept, 263-266 

mathematical treatment of, 262-263, 
368 

Melton’s views and, 340 

on memory, 280-284 

on motivation, 279-280, 469 

psychological tension systems, 260-262 

on reward and punishment, 274-276 

Skinner's views and, 261 

on success and failure, 276-278 

time dimension in, 271-273 

Tolman’s views and, 185, 189, 206, 217 

on typical problems of learning, 284- 
286 

valence concept, 266-267 

Life-space concept (Lewin), 263-268 
Linear-operator models (Bush and Mos- 

teller), 394-398 


McClelland’s theory of affective arousal, 
432-433 
MacCorquodale and Meehl’s postulate 
system, 446-450, 459, 465 
McGeoch’s views, 332, 334, 841, 344, 
350, 351, 362-865, 476 
Maier’s views, 473, 474, 477-479 
“Mand” function, 107-108 
Martin’s spread-of-effect experiment, 32 
Massing-of-trials study (Ericksen), 358 
Mathematical models, 368-405 
calculus applied to, 886-389 
curve-fitting, 869-370 
drive level and delayed reinforcement 
study (Spence), 439-441 
functionalism and, 366 
improvement of theory through, 401- 
403 
learning curves, 370-375 
probability mathematics as basis of, 
389 
stochastic, 388-401 
Mathematics: 
learning models and, see Mathematical 
models 
Lewin’s use of geometry, 262-263 
scientific advances and, 403-405 
Maturation, learning versus, 4 
Mead’s views, 330-331 
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Melton’s views, 334-337, 340-341, 344- 
848, 364, 365 
Memory, 8 
Lewin’s views on, 280-284 
drill versus understanding, 249-255 
Freud’s views on, 300-302 
in gestalt theory, 230 
repression and recall, 813-318 
systematic change in, 243-246 
sce also Reinforcement and Retention 
Miller's stimulus-intensity reduction the- 
ory, 411-414, 430 
Miniature systems, functionalist 
366 
Mixed-schedule reinforcement, 93-94 
Models, learning theory use of, 8, 375- 
385 
Molar: 
Hull’s use of term, 152 
Tolman’s use of term, 185-186 
Molar behavior, 1-2, 186-187, 216-217 
Molar stimulus trace (Hull), 127-128 
Molecular behavior, 1 
Monkey experiments: 
curiosity (Harlow), 430-431 
“learning sets” (Harlow), 438-439 
reward expectancy (Tinklepaugt); 192 


view, 


Montgomery's exploratory-drive hypothe- 
sis, 431 
Morgan's (Lloyd) canon, 224, 476 


Mosteller, see Bush and Mosteller 
Motivation, 468-469 
Freud’s views on, 322 
in functionalism, 330, 333, 337-340 
in gestalt theory, 252 
Guthrie’s views on, 57-58, 76 
Hull’s views on, 128-180, 175 
Lewin’s views on, 258, 279-280, 284, 
285 
Skinner’s views on, 114 
Thorndike’s views on, 43 
Tolman’s views on, 201-202, 216 
two-moment theory, 420-422 
see also Rewards and Punishment 
Motor patterns (Tolman), 207-208 
Mowrer’s views, 12, 413-416, 430 


Need-drive-incentive pattern, 422-433 

Need system (Tolman), 189 

Nervous system, role in learning, 5 

Neurones, 18 

Neurophysiology, learning theory and, 
452-455, 481-482 

Neuropsychological model (Hebb), 451, 
453-454 


Neurosis: 
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aggression and its displacement, 304- 
805 
art and, 311 
Non-continuity theory, in discrimination 
learning, 434-442 
Non-reinforcement, 51 
Nonsense-syllable experiments, 40, 258- 
260, 357-358 


Ogive, normal, 370 

One-trial learning (Skinner), 88 

Operant conditioning (Skinner), 82-119, 
408 

Operant strength, 86 

Organismic psychology, 225 

Organization, laws o (Koffka), 226-229 

Oscillation, behavioral, 142-144 

Osgood’s views, 346-348, 365 

Overlearning, 472-475 


Pavlov’s views, 51, 408 
classical conditioning, 52, 153, 408 
Perceived incentives, 166-167 
Perception: 
Dewey’s interpretation, 331 
gestalt psychology in field of, 226, 254- 
255 


pas 466-467 
earning and, 465-468 
Performance, learning and, 4-5 
“Peripheralist” psychology, 9-10, 13, 187, 
464-465 
Perseveration theory, 355-356 
Person, in life space, 265-266 
Personality development, child-training 
and, 808-310 
Phase sequence (Hebb), 454-455 
Phenomenal observation, in gestalt the- 
ory, 253-254 
Physiology, in approach to learning the- 
ory, 481-482 
Pigeon experiments (Skinner), 88-94 
Place learning, 192-195 
Pleasure-pain theory, 407 
Pleasure principle (Freud), 291-293 
Polarity ( Thorndike), 29 
Postremity prediction (Voeks), 70-72 
Postulate system (MacCorquodale and 
Meehl), 125-126 (Hull), 446-450 
Practice: 
Freud’s views on, 321-322 
functionalist views on, 350-362 
in gestalt theory, 252 
Guthrie’s views on, 55, 76 
Hull’s views on, 174 
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Practice (continued) 
Lewin’s views on, 284 
Skinner's views on, 114 
Thorndike’s views on, 25-26, 42-43 
Tolman’s views on, 215-216 
see also Reinforcement 
Praégnanz, law of, 227-229, 231 
Prediction, in experiment, 71-72 
Preparatory reaction (functionalist), 337- 
338 
Prepotency of elements, 22 
Prepotent-response theory 
410-411 
Proactive and retroactive inhibition, 344 
Probabilistic theories (model builders), 


(Sheffield), 


Probability-bias hypothesis (Smith), 35- 
36 
Problem-box experiments, 16 
Guthrie’s theory, 64-69 
Skinner box, 82 
Problem-solving, 5-6, 475-479 
Process dimensions (Melton), 334-335 
Progressive education, 330 
Provisional try, 469-472, 474-475 
Proximity, law of, in gestalt theory, 228 
Pseudo-reflex, 116 
Psychoanalysis, 328 
laboratory exploration of, 307-321 
learning and, 290-325 
in study of thinking, 310-312 
Psychodynamics, 8 
Psychological tension systems (Lewin), 
260-262 
Psychology: 
developmental, 308-310 
learning theory study, 1-14, 487-488 
Psychotherapy, 108-109, 312-313 
Punch-board maze experiments (Voeks), 
71-72 
Punishment: 
Guthrie’s views on, 60-62, 76-77 
intermittent, 111-112 
Lewin’s views on, 274-276 
practical aspects, 112-113 
psychotherapy and, 108-109 
Skinner’s views on, 89, 97-98, 109-113 
see also Reward and Motivation 
Purposive behavior, 328 
see also Tolman’s views 
Pursuit learning, 354 


Rashevsky’s models, 386 

Rate of learning, functionalist view, 341- 
343 

Rat experiments: 
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adient-abient behavior 
Miller), 168-170 
differential cues (Bailey), 425-426 
discrimination learning (Ehrenfreund ), 
436-437 
drive interaction 
Hayes), 426 
drive reduction (Heron) (Miller and 
Kessen), 423-424 
expectancy versus habit (Brunswik), 
177 
exploratory drive (Montgomery), 431 
hunger drive (Campbell and Kraeling), 
427 
intermediate mechanism (Arnold and 
Hull), 155-159 
latent learning, 196-197, 209-211 
lever-pressing, 52 
Lewin’s views, 268-271 
place learning (Tolman), 193-195 
on positive reinforcement (Olds and 
Milner), 431-432 
prepotent-response 
411 
relational learning (Grice) (Baker and 
Lawrence), 443 
relational learning (Lawrence and De 
Rivera), 442-444 
relevant and irrelevant stimuli (Law- 
rence), 437-438 
response alternation (Hull), 162-165 
Skinner's, 82, 94-119 
stimulus-intensity reduction 
and Kessen), 411-412 
Rational curve-fitting, 369-370, 372-375 
Ratio reinforcement, 90-94 
Reaction potential (Hull), 132-187, 142- 
147, 178-180 
Readiness, law of (Thorndike), 18-19 
Reading, 19, 44 
Reality principle (Freud), 293-294 
Reasoning, learning and, 5-6 
Recall, see Memory 
Recognition threshold, repression and, 
318-321 
Recovery, spontancous, 51, 52 
Reflex-arc concept (Dewey), 329 
Reflexes, 3 
conditioned, 50-53 
Skinner’s views on, 86, 100-105, 115- 
116 
unconditioned, 50 
Reflex reserve, 86 
Regression (Freud), 303-304 
Reinforcement, 11-12, 51, 52, 85-86, 469- 
472 
confirmation versus (Tolman ), 201-202 


(Brown and 


(Verplanck and 


(Sheffield), 410- 


(Miller 


a 
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extinction as measure of, 86-89 
habit-strengthening influence, see Hull’s 
views 
Hull’s views on, 122-123, 128-130, 153- 
160, 177, 208 

interpretations of, 408-422 

intervals of, 90-94 

latent learning and, 208-215 

pleasure principle and (Freud), 291- 

293 

ositive and negative primary, 89-90 

role of (Spence), 414-419 

secondary, 94-96, 129-130, 148-150, 

410 

see also Motivation 
Repetition, in gestalt theory, 232-233 
Repetition-compulsion principle (Freud), 

294-297 

Repression: 

Freud’s view, 300-302 

recall and, 318-318 

recognition threshold and, 318-321 
Research, pure versus applied, 488-490 
Respondent behavior, 82-85 
Respondents, 83 
Responding organism (Hull), 127-128 
Response, 10 

by analogy (Thorndike), 22 

altematisa in (Hull), 161-165 

conditioned, 50-53, 72-74 

differentiation of (Skinner), 100-101 

functionalist views of, 340-341 

initial discovery and, 340 

multiple (Thorndike), 21 

novelty of, 100 

punishment and, 110-111 2 

reaction potential and (Hull), 182 

stimulus and, see Stimulus-response 

(S-R) theories 
Response-completeness t 
407 

Response generalization (Hull), 144 
Response probability (Voeks), 70, 74 
Retention: 


curve of (Ebbinghaus), 369 
drill versus understanding, 249-251 


functionalist views, 344-348, 357-358 
gestalt theory and, 246-248 
see also Reinforcement 

Retroactive inhibition, 344 

Reward: 
Guthrie’s views on, 58-60, 76 
Lewin’s views on, 274-276 
Thorndike’s views on, see Effect, law of 
see also Motivation 


Reward expectancy, 192 
Robinson’s Es: 332, 342-348, 862-864 


heory (Peterson Ja 
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gestalt theory and, 225 
Skinner's views and, 343 
Rote memorization, 249-251 
Rotter’s social-learning theory, 450-451 


Schizophrenia, 304 
Schoolroom learning (Skinner), 106-107 
“Set” concept, 21, 338-339 
Seward’s two-moment theory of motivated 
learning, 420-422 
Sheffield’s prepotent-response theory, 410- 
411 
Sheffield’s 
37-38 
Sherrington’s response classes, 409, 464 
Sign-gestalt theory, 185 
Sign learning: 
Mowrer’s, 413 
Tolman’s, 185-220 
Sign-significate theory, 185 
Similarity: 
in gestalt theory, 227-228 
in retroactive inhibition, 344-348 
Simple conditioning, 462-463 
Situational dimensions (Melton), 334- 
335 
Situation-set (Woodworth ), 338-339 
Skaggs-Robinson hypothesis, 344-348 
Skinner box, 82, 162 
Skinner’s views, 8, 12, 138, 82-119, 328, 
452 
achievements of, 118-119 
anti-theory argument, 117-118 
co-ordination with other viewpoints, 
116-117 
on differentiation of a response, 98, 
100-101 
on discrimination of stimuli, 98-100 
on drive, 96-97 
on emotion, 97-98 
functionalist views and, 360, 366 
Hull’s views and, 131, 137, 144 
Lewin’s views and, 261 
mathematics in, 344 
on psychotherapy, 108-109 
on punishment, 97-98, 109-113 
on reflex as unit, 100-105, 115-116 
on reinforcement, 85-96, 408, 409 
on respondent and operant behavior, 
82-85 
Robinson’s views and, 343 
on schoolroom learning, 106-107 
Spence’s views and, 415 
Tolman’s views and, 208 
on typical problems of learning, 113- 
115 


spread-of-effect experiment, 
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Skinner’s views (continued) 
on verbal behavior, 107-108 
Woodworth’s views and, 338 
Smith’s (M. H., Jr.) spread-of-effect ex- 
eriment, 35-36 
Social-learning theory (Rotter), 450-451 
Solution learning (Mowrer), 413 
Specificity doctrine (Thorndike), 43-45 
Spence’s views, 426, 427, 449, 459, 464, 
465 
on discrimination learning, 434-445 
Hull’s views and, 444 
on incentive motivation, 429 
on role of reinforcement, 409, 414-419, 
420, 421 
Spread of effect (Thorndike), 29-42 
S-R theory, see Stimulus-response theory 
Statistical-association model (Estes), 389- 
393 
Statistics, 401-402 
Stephens’s spread-of-effect experiment, 
33-34 
Stereotypy, 67-69, 472-475 
Stimulus: 
discrimination of (Skinner), 98-100 
equivalence and interaction (Hull), 
139-142 
punishment and, 111 
unconditioned, 50 
Stimulus-intensity reduction theory (Mil- 
ler), 411-412 
Stimulus-response (S-R) theories, 9-13 
15, 328, 445-446 
connectionism as, see Connectionism 
Guthrie’s, see Guthrie’s views 
Hull’s, see Hull’s views 
kinds of, 464-465 
mediating processes, 464-465 
perceptions and, 468 
reinforcement in, 408-422 
sign learning and, 191-197 
Skinner’s system, see Skinner’s views 
substantive, 464-465 
Watson’s, 49-53 
Stimulus trace, components table, 152 
Stochastic models, 388-401 
Strength, definition of, 19 
Success, Lewin’s view of, 276-278 
Surface, Osgood’s, 347 
Symbolic constructs, 122 


“Tact” function, 107-108 
“Teachableness,” 2 
Teaching: 
automatic, 106-107 
gestalt theory and, 242 
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Theorem system (Vocks), 70-76 
Theory-of-games model, 383-385 
Therapy, as learning, 312-313 
Thinking: 
as implicit speech, 49 
psychodynamics of, 310-312 
Thirst-drive experiments, 424 
Thorndike’s views, 8, 12, 15-45, 65, 78, 
82, 100, 328, 407, 452, 470, 475, 
476 
capacity laws, 203-204 
connectionism, 15-45 
functionalist views, 363 
gestalt theory and, 222-295, 232 
Hull’s views and, 161 
Skinner’s views and, 115, 119 
specificity doctrine, 43-45 
Tolman’s views and, 203, 204, 217 
typical problems of learning and, 42-43 
Thurstone’s mathematical learning curves, 
372-875, 386, 389 
Tilton’s spread-of-effect experiment, 31- 
33 
Time dimension (Lewin), 271-273 
Time-perspective (Frank), 272 
Tolman’s views, 2, 8, 12, 185-220, 328, 
366, 389, 407, 446, 452, 470, 476, 
477 
Brunswik’s views and, 185 
expectancy versus habit, 197-201 
Guthrie's views and, 207-210, 217, 449 
Hull’s views and, 132, 189, 196, 202- 
212, 217, 218 
on intervening variables, 187-189, 218- 
219 
on latent learning, 208-215 
laws relating to manner of presentation, 
205-206 
laws relating to nature of material, 
204-205 
learned connections and their 1 
206-208 
Lewin’s views and, 185, 189, 206, 217 
` McDougall’s views and, 185 
Melton’s views and, 340 
on molar behavior, 186-187, 216-217 
on motivation, 201-202 
on response learning, 191-197 
Skinner’s views and, 208 
Thorndike’s views and, 203, 204, 217 
spral problems of learning and, 215- 
16 i 


aws, 


Watson’s views and, 185, 186, 219 
Woodworth’s views and, 185, 339 


Topological psychology, 262-263 


see also Lewin’s views 


Trace theory (Koffka), 229-232 
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Transfer: 
Freud’s view of, 822 
functionalist view of, 344-348, 363 
in gestalt theory, 239, 252-253 
Guthrie’s view of, 77 
Hull's view of, 175 
identical-elements theory, 24-25 
Lewin’s view of, 284 
Skinner's view of, 115 
Thorndike’s view of, 43 
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